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POLYPEPTIDES AND POLYNUCLEOT IDES USEFUL FOR THE 
DIAGNOSIS AND TREATMENT OF PA THOGENIC NEISSERIA 



Technical Field 

The invention relates to materials and 
methodologies for managing the spread of infections 
caused by microorganisms having type 4 pilin, for 
example, Neisseria. More specifically, it relates to 
polypeptides and antibodies useful in vaccines for the 
treatment of pathologic infections caused by these 
microorganisms. It also relates to polynucleotides 
useful for the recombinant production of these 
polypeptides. In addition, it. relates to polypeptides, 
antibodies, and polynucleotides used for the detection of 
these strains. 

Background Art 

Type 4 pilins are expressed by several 
bacterial genuses, including Neisseria, Moraxella, 
Bacteroides, and Pseudomonas. Species within these 
genuses which have pathogenic members that express type 4 
pilins are, for example, N. gonorrhoeae, N. meningitidis, 
M. bovis, B. nodosus, and P. aeruginosa. In addition, 
the Tcp pilin of V. cholerae is highly homologous to the 
type 4 pilins of other genuses. 
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The only known reservoir of the neisseriae is 
man. The genus includes two gram-negative species of 
pyogenic cocci that are pathogenic for man: the 
meningococcus (neisseria meningitidis) and the gonococcus 

5 {Neisseria gonorrheae) . 

N. Meningitidis causes a variety of infections, 
most notably, meningitis and bacteremia. Meningococci 
can be divided into serologic groups on the basis of 
agglutination reactions with immune serum. The present 

10 classification includes groups A through Z. Clinically 
significant new groups encompass Y and W 135. The major 
groups are remarkably heterogeneous, but subclassi- 
f ication with additional serologic markers has been 
possible. Noncapsular antigens have provided the basis 

15 for dividing strains of groups into distinct types. 

Meningococci cause either epidemic or sporadic 
disease, and historically, there has been a cyclic 
variation in the prevalence of meningococcal infection 
with peaks of increased frequency occurring every 8 to 12 

20 years and lasting 4 to 6 years. The attack rate of 

meningococcal disease is highest for children between 6 
months and 1 year. In the first half of this century, 
most epidemics of meningococcal disease in the United 
States were caused by group A organisms. In the past two 

25 decades, first group B then group C meningococci were 
responsible for outbreaks in both the military and 
civilian populations. Currently, group B is responsible 
for 50 to 55 percent of reported cases. 

Gonorrhea, which is caused by N. gonorrhea, is 

30 an infection of columnar and transitional epithelium. 
This disease is the most common reportable communicable 
disease in the United States, and also has world-wide 
prevalence . 



35 



WO 92/13871 



PCT/US92/00863 



-3- 

Although treatment of disease caused by 
gonococci and meningococci are often treated with 
antibiotics, these microorganisms often develop 
antibiotic resistance* Thus, prevention with vaccines is 
5 a preferable mode to contain the spread of ' infection. 
However, for a variety of reasons, including antigenic 
variation, the development of vaccines has been greatly 
hampered. For example, a vaccine which prevents 
gonorrhea is still lacking. In addition, although 56% of 

10 the causes of meningococcal disease are caused by 

serogroup B, an effective vaccine against this serogroup 
is also lacking. 

2f. gonorrheae and N. meningitidis are organisms 
completely adapted to the human host, having no other 

15 ecological niche. They have acquired a large arsenal of 
strategies to overcome the human host defense system. 

The first step in infection with pathological 
forms of these Neisseria is adherence to target cells. 
It is thought that the pili of these microorganisms are a 

20 major virulence factor. For example, it is known that in 
the case of 27. gronozxheae, piliated (P + ) variants attach 
much better to susceptible cells than non-piliated (P-) 
variants (Swanson, 1973; Pearce and Buchanan, 1978). 
Moreover, P+ variants, unlike P- variants, are able to 

25 establish an infection in human volunteers (Kellog et al, 
1968) . 

Although the pilus protein elicits an immune 
response, so many antigenic variants exist and continue 
to develop that vaccines against the pilus protein are 
30 not highly effective. 

Pilin is the major subunit of the pilus. 
• Expression of pilin is controlled at the pilE locus. 
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B-rif>f Description of t-ft* Tnvention 

We have isolated and characterized a novel 
protein of pathogenic forms of Neisseria, PilC, that may 
be is associated with the pili of gonococci and 
5 meningococci. We have also isolated and characterized 
genes which encode PilC, i.e., the pile loci. 

Portions of the DNA sequences of the pile genes 
are useful as probes to diagnose the presence of the 
relevant Neisseria in samples. These DNAs also make 
10 available polypeptide sequences of immunoreactive 

epitopes encoded within the loci, thus permitting the 
production of polypeptides which are useful as standards 
or reagents in diagnostic tests and/or as components of 
vaccines for microorganisms with type 4 pilin and 
15 containing one or more epitopes that are immunologically 
identifiable with an epitope encoded in pile of 
Neisseria. Antibodies, both monoclonal and purified 
polyclonal, directed against PilC epitopes are also 
useful for diagnostic tests and as therapeutic agents for 
passive immunization. In addition, by utilizing probes 
derived from the DNA sequences, it is possible to isolate 
and sequence portions of the pile loci from species and 

strains of interest. 

Accordingly, one embodiment of the invention is 
25 a recombinant polynucleotide encoding a polypeptide 

comprised of an immunoreactive epitope of a protein 

encoded in pilC of Neisseria. 

Another embodiment of the invention is a 

recombinant expression system comprising a polynucleotide 
30 encoding a polypeptide comprised of an immunoreactive 

epitope of a protein encoded in pilC of Neisseria, 

wherein the polynucleotide is operably linked to a 

control sequence compatible with a desired host. 
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Yet another embodiment of the invention is 
purified polypeptide comprised of an immunoreactive 
epitope of a protein encoded in pilC of Neisseria. 

Another embodiment of the invention is a 
5 recombinant polypeptide comprised of an immunoreactive 
epitope of a protein encoded in pilC of Neisseria. 

Still another embodiment of the invention is a 
vaccine composition for the treatment of Neisseria 
infection, comprised of a pharmaceutical^ acceptable 
10 excipient and of an effective amount of a recombinant 
polypeptide, wherein the polypeptide is comprised of an 
immunoreactive epitope of a protein encoded in pilC of 
Neisseria. 

Yet another embodiment of the invention is a 
15 composition comprised of purified polyclonal anti-PilC 
antibodies, wherein the PilC is of Neisseria. 

An additional embodiment of the invention is a 
composition comprised of a monoclonal antibody directed 
against an immunoreactive epitope encoded in pilC of 
20 Neisseria. 

Another embodiment of the invention is a method 
for producing antibodies to PilC of Neisseria comprising 
administering to an individual a composition comprised of 
an isolated immunogenic polypeptide containing a PilC 
25 epitope in an amount sufficient to produce an immune 
response . 

Yet another embodiment of the invention is an 
oligomer capable of hybridizing to a sequence in pilC of 
Neisseria, wherein the oligomer is comprised of a pilC 
30 sequence complementary to at least about 6 contiguous 
nucleotides of pilC. 

Still another embodiment of the invention is a 
process for detecting a pilC sequence in an analyte 
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strand, wherein the pilC sequence comprises a selected 
target region, the process comprising: 

(a) providing a sample comprised of an analyte 
strand suspected of containing a selected target pilC 

5 sequence; 

(b) providing an oligomer capable of 
hybridizing to the target pile sequence, wherein the 
oligomer is comprised of a pilC targeting sequence 
complementary to at least about 6 contiguous nucleotides 

10 of pilC; 

(c) incubating the sample of (a) with the 
oligomer of (b) under conditions which allow specific 
hybrid duplexes to form between the targeting sequence 
and the target sequence; and 

15 (a) detecting hybrids formed between the 

target sequence, if any, and the oligomer. 

Yet another embodiment of the invention is a 
recombinant polynucleotide comprising a DNA sequence of 
at least 8 contiguous nucleotides from pilC, wherein the 

20 pilC sequence is selected from the group of sequences 
shown in Figure 3, Figure 6, and Figure 7. 

Another embodiment of the invention is a method 
of treating an individual for a Neisseria infection 
comprising administering to the individual antibodies 

25 produced according to claim 31, wherein the antibodies 
are administered in an amount effective to prevent the 
pathology of the infection. 

Brief Descr i ption of Hip Drawings 
30 Figure 1 is a genetic and physical map of pilC 

locus 1, showing the restriction enzyme sites. 

Figure 2 shows the nucleotide sequence and the 
deduced amino acid sequence of the 5' -end of pilCl. 
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Figure 3 shows the nucleotide sequence of the 
sense strand of the pilCl gene. 

Figure 4 shows the shows the nucleotide 
sequence of the sense strand of the pilCl gene and the 
5 amino acids encoded therein. 

Figure 5 shows the nucleotide sequence of the 
sense strand of the pilCl gene, and the effect of frame 
shift on the putative gene products encoded therein. 

Figure 6 shows the nucleotide sequences of PCR 
10 amplif ied fragments demonstrating a variation in the 

length of the G tract and sequence differences in the 5' 
region of the pile genes. 

Figure 7 shows the DNA sequence of the 3' -end 
of the pilC2 fragment. 
15 Figure 8 shows the pilC2 fragment sequence, and 

the putative amino acids encoded therein. 

Figure 9 shows a comparison of the analogous 
portions of pilC2 (top) and pilCl (bottom) DNA sequences, 
and the putative amino acids encoded therein. 

20 

Modes for Carrying Out the Invention 

The present invention provides polypeptides, 
antibodies, and polynucleotides which are useful for the 
detection and treatment of pathogenic microorganisms 

25 having type 4 pilin, for example, Neisseria, Moraxella, 
Bacteroides, and Pseudomonas. 

We have discovered a polypeptide, PilC, which 
is present in N. gonorrheae. This polypeptide is a 110 
kd protein that is closely associated with the pili of 

30 the microorganism. Most strains of N. gonorrheae carry 
two copies of the corresponding genes which encode the 
polypeptide (s) ; these genes have been denoted pilC. 
Expression from the pilC loci is regulated by frequent 
frameshif t mutations within a run of G residues in the 

35 
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region encoding the signal peptide. The two pilC genes 
of N. gonorrheae are not identical. Hence, alternate 
expression from either the pilCl or the pilC2 loci gives 
rise to two different forms of PilC. Among nonpiliated 
5 (P") descendants from P + clones, clones were found that 
expressed pilin but not PilC. All P + revertants from 
such PilC non-piliated clones have regained expression 
of PilC. Hence, phase variation of gonococcal pili can 
be caused by frameshift mutations in pilC. Transposon 
10 inactivation of the expressed pilC2 copy resulted in a 
nonpiliated, pilin producing revertible phenotype. It 
appears, therefore, that PilC is required for assembly of 
pilin subunits into a polymerized pilus fiber in N. 
gonorrheae . 

15 we have cloned and isolated gene, pllCl, from 

N. gonorrheae. In addition, by comparison of this gene 
sequence with a related sequence, we have cloned a 
fragment of the pil2 gene. Moreover, using 
polynucleotide probes derived from isolated pilCl and PCR 

20 amplification, we have detected two possible variants of 
a pilC gene in N. meningitidis. The sequences of pilC 
reported herein appear to be novel, in that there are no 
reported counterparts in Genbank, and no significant 
homologies were found with any available sequences in 

25 that data base. 

The useful materials and processes of the 
present invention are made possible by the provision of 
the sequences of the pilC genes from N. gonorrheae and 
from N. meningitidis. Information present in the 

30 sequences of the pilC genes allows for the design of 
polypeptides which may be useful as vaccines for 
treatment of pathogenic Neisseria, as diagnostic tools 
for the detection of these microorganisms, and as agents 
for the preparation of antibodies to PilC. In addition, 

35 
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this information allows for the design of polynucleotides 
for the recombinant production of the polypeptides 
derived from Pile, and for the design of oligomers which 
are useful as probes and primers for the detection and 
5 amplification of target regions of pilC. 

The practice of the present invention will 
employ, unless otherwise indicated, conventional 
techniques of molecular biology, microbiology, 
recombinant DNA, and immunology, which are within the 

10 skill of the art. Such techniques are explained fully in 
the literature. See e.g., Maniatis, Fitsch & Sambrook, 
MOLECULAR CLONING; A LABORATORY MANUAL, Second Edition 
(1989) ; DNA CLONING, VOLUMES I AND II (D.N Glover ed. 
1985); OLIGONUCLEOTIDE SYNTHESIS (M.J, Gait ed, 1984); 

15 NUCLEIC ACID HYBRIDIZATION (B.D. Hames & S.J. Higgins 

eds. 1984); TRANSCRIPTION AND TRANSLATION (B.D. Hames & 
S.J. Higgins eds. 1984); ANIMAL CELL CULTURE (R.I. 
Freshney ed. 1986) ; IMMOBILIZED CELLS AND ENZYMES (IRL 
Press, 1986); B. Perbal, A PRACTICAL GUIDE TO MOLECULAR 

20 CLONING (1984) ; the series, METHODS IN ENZYMOLOGY 
(Academic Press, Inc.); GENE TRANSFER VECTORS FOR 
MAMMALIAN CELLS (J.H. Miller and M.P. Calos eds. 1987, 
Cold Spring Harbor Laboratory) , Methods in Enzymology 
Vol. 154 and Vol. 155 (Wu and Grossman, and Wu, eds., 

25 respectively), Mayer and Walker, eds. (1987), 

IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY 
(Academic Press, London), Scopes, (1987), PROTEIN 
PURIFICATION: PRINCIPLES AND PRACTICE, Second Edition 
(Springer-Verlag, N.Y.), and HANDBOOK OF EXPERIMENTAL 

30 IMMUNOLOGY, VOLUMES I-IV (D.M. Weir and C. C. Blackwell 
eds 1986). All patents, patent applications, and 
publications mentioned herein, both supra and infra, are 
hereby incorporated herein by reference. 
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As used herein, a polynucleotide "derived from" 
a designated sequence refers to a polynucleotide sequence 
which is comprised of a sequence of approximately at 
least about 6 nucleotides, preferably at least about 8 
5 nucleotides, more preferably at least about 10-12 
nucleotides, and even more preferably at least about 
15-20 nucleotides corresponding to a region of the 
designated nucleotide sequence. "Corresponding" means 
homologous to or complementary to the designated 

10 sequence. Regions from which typical polynucleotide 
sequences may be "derived" include but are not limited 
to, for example, regions encoding specific epitopes, as 
well as non- transcribed and/or non- translated regions. 

The derived polynucleotide is not necessarily 

15 physically derived from the nucleotide sequence shown, 
but may be generated in any manner, including for 
example, chemical synthesis or DNA replication or reverse 
transcription or transcription. In addition, 
combinations of regions corresponding to that of the 

20 designated sequence may be modified in ways known in the 
art to be consistent with an intended use. 

Similarly, a polypeptide or amino acid sequence 
"derived from" a designated nucleic acid sequence refers 
to a polypeptide having an amino acid sequence identical 

25 to that of a polypeptide encoded in the sequence, or a 
portion thereof wherein the portion consists of at least 
3-5 amino acids, and more preferably at least 8-10 amino 
acids, and even more preferably at least 11-15 amino 
acids, or which is immunologically identifiable with a 

30 polypeptide encoded in the sequence. This terminology 
also includes a polypeptide expressed from a designated 
nucleic acid sequence. 

A recombinant or derived polypeptide is not 
necessarily translated from a designated nucleic acid 

35 
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sequence. It may be generated in any manner, including 
for example, chemical synthesis, or expression of a 
recombinant expression system, or isolation from a 
microorganism. A recombinant or derived polypeptide may 
5 include one or more analogs of amino acids or unnatural 
amino acids in its sequence. Methods of inserting 
analogs of amino acids into a sequence are known in the 
art. It also may include one or more labels, which are 
known to those of skill in the art. 

10 The term "recombinant polynucleotide" as used 

herein intends a polynucleotide of genomic, cDNA, 
semisynthetic, or synthetic origin which, by virtue of 
its origin or manipulation: (1) is not associated with 
all or a portion of a polynucleotide with which it is 

15 associated in nature, (2) is linked to a polynucleotide 
other than that to which it is linked in nature, (3) does 
not occur in nature, or (4) is not in the form of a 
library. 

The term" "polynucleotide w as used herein refers 
20 to a polymeric form of nucleotides of any length, either 
ribonucleotides or deoxyribonucleotides . This term 
refers only to the primary structure of the molecule. 
Thus, this term includes double- and single- stranded DNA 
and RNA. It also includes known types of modifications, 
25 for example, labels which are known in the art, 

methylation, "caps", substitution of one or more of the 
naturally occurring nucleotides with an analog, 
internucleotide modifications such as, for example, those 
with uncharged linkages (e.g., methyl phosphonates , 
30 phosphotriesters, phosphoamidates , carbamates, etc.) and 
with charged linkages (e.g., phosphorothioates , 
phosphorodithioates , etc.), those containing pendant 
moieties, such as, for example proteins (including for 
e.g., nucleases, toxins, antibodies, signal peptides, 
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poly-L-lysine, etc.), those with intercalators (e.g., 
acridine, psoralen, etc.), those containing chelators 
(e.g., metals, radioactive metals, boron, oxidative 
metals, etc.), those containing alkylators, those with 
5 modified linkages (e.g., alpha anomeric nucleic acids, 
etc.), as well as unmodified forms of the polynucleotide. 

The term "purified polynucleotide" refers to a 
polynucleotide which is essentially free, i.e., contains 
less than about 50%, preferably less than about 70%, and 
10 even more preferably less than about 90% of polypeptides 
with which the polynucleotide is naturally associated. 
Techniques for purifying polynucleotides from bacteria 
are known in the art, and include for example, disruption 
of the bacteria with a chaotropic agent, differential 
15 extraction and separation of the polynucleotide (s) and 
polypeptides by ion-exchange chromatography, affinity 
chromatography, and sedimentation according to density. 

The term "purified polypeptide" refers to a 
polypeptide or fragment thereof which is essentially 
20 free, i.e., contains less than about 50%, preferably less 
than about 70%, and even more preferably less than about 
90%, of cellular components with which the polypeptide is 
naturally associated. Techniques for purifying 
polypeptides are known in the art, and examples of these 
25 techniques are discussed infra. 

"Recombinant host cells", "host cells", 
"cells", "cell lines", "cell cultures", and other such 
terms denoting microorganisms or higher eukaryotic cell 
lines cultured as unicellular entities refer to cells 
30 which can be, or have been, used as recipients for 

recombinant vector or other transfer DMA, and include the 
progeny of the original cell which has been transfected. 
It is understood that the progeny of a single parental 
cell may not necessarily be completely identical in 
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morphology or in genomic or total DNA complement as the 
original parent, due to natural, accidental, or 
deliberate mutation. 

A "replicon" is any genetic element, e.g., a 
5 plasmid, a chromosome, a virus, a cosmid, etc. that 

behaves as an autonomous unit of polynucleotide replica- 
tion within a cell; i.e., capable of replication under 
its own control. 

A "vector" is a replicon in which another 
10 polynucleotide segment is attached, so as to bring about 
the replication and/or expression of the attached 
segment . 

"Control sequence" refers to polynucleotide 
sequences which are necessary to effect the expression of 

15 coding sequences to which they are ligated. The nature 
of such control sequences differs depending upon the host 
organism; in prokaryotes, such control sequences 
generally include promoter, ribosomal binding site, and 
terminators; in eukaryotes, generally, such control 

20 sequences include promoters, terminators and, in some 
instances, enhancers. The term "control sequences" is 
intended to include, at a minimum, all components whose 
presence is necessary for expression, and may also 
include additional components whose presence is 

25 advantageous, for example, leader sequences. 

"Operably linked" refers to a juxtaposition 
wherein the components so described are in a relationship 
permitting them to function in their intended m a nn er. A 
control sequence "operably linked" to a coding sequence 

30 is ligated in such a way that expression of the coding 

sequence is achieved under conditions compatible with the 
control sequences. 

An "open reading frame" (ORF) is a region of a 
polynucleotide sequence which encodes a polypeptide; this 
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region may represent a portion of a coding sequence or a 
total coding sequence. 

A "coding sequence" is a polynucleotide 
sequence which is transcribed into mRNA and/or translated 
5 into a polypeptide when placed under the control of 

appropriate regulatory sequences. The boundaries of the 
coding sequence are determined by a translation start 
codon at the 5' -terminus and a translation stop codon at 
the 3 '-terminus. A coding sequence can include, but is 
10 not limited to mRNA, cDNA, and recombinant polynucleotide 
sequences . 

"Immunologically identifiable with/as" refers 
to the presence of epitope (s) and polypeptides (s) which 
are also present in the designated polypeptide (s) . 

15 immunological identity may be determined by antibody 

binding and/or competition in binding; these techniques 
are known to those of average skill in the art, and are 
also illustrated infra. 

As used herein, "epitope" refers to an 

20 antigenic determinant of a polypeptide. An epitope could 
comprise 3 amino acids in a spatial conformation which is 
unique to the epitope. Generally an epitope consists of 
at least 5 such amino acids, and more usually, consists 
of at least 8-10 such amino acids. Methods of 

25 determining the spatial conformation of amino acids are 
known in the art, and include, for example, x-ray 
crystallography and 2 -dimensional nuclear magnetic 
resonance . 

A polypeptide is "immunoreactive" when it is 
"immunologically reactive" with an antibody, i.e., when 
it binds to an antibody due to antibody recognition of a 
specific epitope contained within the polypeptide. 
Immunological reactivity may be determined by antibody 
binding, more particularly by the kinetics of antibody 
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binding, and/or by competition in binding using as 
competitor (s) a known polypeptide (s) containing an 
epitope against which the antibody is directed. The 
techniques for determining whether a polypeptide is 
5 immunologically reactive with an antibody are known in 
the art. An "immunoreactive" polypeptide may also be 
"immunogenic n . As used herein, the term "immunogenic 
polypeptide" is a polypeptide that elicits a cellular 
and/or humoral immune response, whether alone or linked 

10 to a carrier in the presence or absence of an adjuvant. 

As used herein, the term "antibody" refers to a 
polypeptide or group of polypeptides which are comprised 
of at least one antibody combining site. An "antibody 
combining site" or "binding domain" is formed from the 

15 folding of variable domains of an antibody molecule (s) to 
form three-dimensional binding spaces with an internal 
surface shape and charge distribution complementary to 
the features of an epitope of an antigen, which allows an 
immunological reaction with the antigen. An antibody 

20 combining site may be formed from a heavy and/or a light 
chain domain (VH and VL, respectively) , which form 
hypervariable loops which contribute to antigen binding. 
The term "antibody" includes, for example, vertebrate 
antibodies, hybrid antibodies, chimeric antibodies, 

25 altered antibodies, univalent antibodies, the Fab 
proteins, and single domain antibodies. 

As used herein, a "single domain antibody" 
(dAb) is an antibody which is comprised of an VH domain, 
which reacts immunologically with a designated antigen. 

30 A dAB does not contain a VL domain, but may contain other 
antigen binding domains known to exist in antibodies, for 
example, the kappa and lambda domains. Methods for 
preparing dABs are known in the art. See, for example, 
Ward et al. (1989) . 
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Antibodies may also be comprised of VH and VL 
domains, as well as other known antigen binding domains. 
Examples of these types of antibodies and methods for 
their preparation are known in the art (see, e.g., U.S. 
Patent No. 4,816,467, which is incorporated herein by 
reference) , and include the following. For example, 
"vertebrate antibodies" refers to antibodies which are 
tetramers or aggregates thereof, comprising light and 
heavy chains which are usually aggregated in a "Y" 
configuration and which may or may not have covalent 
linkages between the chains. In vertebrate antibodies, 
the amino acid sequences of all the chains of a 
particular antibody are homologous with the chains found 
in one antibody produced by the lymphocyte which produces 
15 that antibody in situ, or in vitro (for example, in 
hybridomas) . Vertebrate antibodies typically include 
native antibodies, for example, purified polyclonal 
antibodies and monoclonal antibodies. Examples of the 
methods for the preparation of these antibodies are 

20 described infra. 

"Hybrid antibodies" are antibodies wherein one 
pair of heavy and light chains is homologous to those in 
a first antibody, while the other pair of heavy and light 
chains is homologous to those in a different second anti- 
25 body. Typically, each of these two pairs will bind 

different epitopes, particularly on different antigens. 
This results in the property of "divalence", i.e., the 
ability to bind two antigens simultaneously. Such 
hybrids may also be formed using chimeric chains, as set 
30 forth below. 

"Chimeric antibodies", are antibodies in which 
the heavy and/or light chains are fusion proteins. 
Typically the constant domain of the chains is from one 
particular species and/or class, and the variable domains 
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are from a different species and/or class. Also included 
is any antibody in which either or both of the heavy or 
light chains are composed of combinations of sequences 
mimicking the sequences in antibodies of different 
5 sources, whether these sources be differing classes, or 
different species of origin, and whether or not the 
fusion point is at the variable/ constant boundary. Thus, 
it is possible to produce antibodies in which neither the 
constant nor the variable region mimic known antibody 

10 sequences. It then becomes possible, for example, to 
construct antibodies whose variable region has a higher 
specific affinity for a particular antigen, or whose 
constant region can elicit enhanced complement fixation, 
or to make other improvements in properties possessed by 

15 a particular constant region. 

Another example is "altered antibodies", which 
refers to antibodies in which the naturally occurring 
amino acid sequence in a vertebrate antibody has been 
varied. Utilizing recombinant DNA techniques, antibodies 

20 can be redesigned to obtain desired characteristics. The 
possible variations are many, and range from the changing 
of one or more amino acids to the complete redesign of a 
region, for example, the constant region. Changes in the 
constant region, in general, to attain desired cellular 

25 process characteristics, e.g., changes in complement 

fixation, interaction with membranes, and other effector 
functions. Changes in the variable region may be made to 
alter antigen binding characteristics. The antibody may 
also be engineered to aid the specific delivery of a 

30 molecule or substance to a specific cell or tissue site. 
The desired alterations may be made by known techniques 
in molecular biology, e.g., recombinant techniques, site 
directed mutagenesis, etc. 
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Yet anther example are "univalent antibodies", 
which are aggregates comprised of a heavy chain/light 
chain dimer bound to the Fc (i.e., constant) region of a 
second heavy chain. This type of antibody escapes 
5 antigenic modulation. See, e.g., Glennie et al. (1982). 

Included also within the definition of antibod- 
ies are "Fab" fragments of antibodies. The "Fab" region 
refers to those portions of the heavy and light chains 
which are roughly equivalent, or analogous, to the 

10 sequences which comprise the branch portion of the heavy 
and light chains, and which have been shown to exhibit 
immunological binding to a specified antigen, but which 
lack the effector Fc portion . "Fab" includes aggregates 
of one heavy and one light chain (commonly known as 

15 Fab' ) , as well as tetramers containing the 2H and 2L 
chains (referred to as F(ab) 2 ), which are capable of 
selectively reacting with a designated antigen or antigen 
family. "Fab" antibodies may be- divided into subsets 
analogous to those described above, i.e, "vertebrate 

20 Fab", "hybrid Fab", "chimeric Fab", and "altered Fab". 
Methods of producing "Fab" fragments of antibodies are 
known within the art and include, for example, 
proteolysis, and synthesis by recombinant techniques. 

The term "polypeptide" refers to a polymer of 

25 amino acids and does not refer to a specific length of 
the product; thus, peptides, oligopeptides, and proteins 
are included within the definition of polypeptide. This 
term also does not refer to or exclude post-expression 
modifications of the polypeptide, for example, 

30 glycosylations, acetylations , phosphorylations and the 
like. Included within the definition are, for example, 
polypeptides containing one or more analogs of an amino 
acid (including, for example, unnatural amino acids, 
etc.), polypeptides with substituted linkages, as well as 
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other modifications known in the art, both naturally 
occurring and non- naturally occurring. 

"Transformation" , as used herein, refers to the 
insertion of an exogenous polynucleotide into a host 
5 cell, irrespective of the method used for the insertion, 
for example, direct uptake, transduction, f -mating or 
electroporation. The exogenous polynucleotide may be 
maintained as a non- integrated vector, for example, a 
plasmid, or alternatively, may be integrated into the 

10 host genome. 

"Treatment" as used herein refers to 
prophylaxis and/or therapy. 

By "immunogenic" is meant an agent used to 
stimulate the immune system of a living organism, so that 

15 one or more functions of the immune system are increased 
and directed towards the immunogenic agent. Immunogenic 
agents include vaccines. Immunogenic agents can be used 
in the production of antibodies, both isolated polyclonal 
antibodies and monoclonal antibodies, using techniques 

20 known in the art. 

By vaccine is meant an agent used to stimulate 
the immune system of a living organism so that protection 
against or amelioration of future harm is provided. 
Immunization refers to the process of inducing an 

25 increased level of antibodies and/or cellular immune 

response in which T- lymphocytes respond by killing the 
pathogen and/or activate other cells involved in the 
immune response pathway. The antibodies produced as a 
result of immunization may belong to any of the 

30 immunological classes, such as immunoglobulins A, D, E, G 
or M. 

An "individual", as used herein, refers to 
vertebrates, particularly members of the mammalian 
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species, and includes but is not limited to domestic 
animals, sports animals, and primates, including humans. 

As used herein, the "sense strand" of a nucleic 
acid contains the sequence that has sequence homology to 
5 that of mRNA. The "anti- sense strand" contains a 

sequence which is complementary to that of the "sense 
strand" . 

As used herein, the term "probe" refers to a 
polynucleotide which forms a hybrid structure with a 

10 sequence in a target region, due to complementarity of at 
least one sequence in the probe with a sequence in the 
target region. The polynucleotide regions of probes may 
be composed of DNA, and/or RNA, and/or synthetic 
nucleotide analogs. 

15 as used herein, the term "target region" refers 

to a region of the nucleic acid which is to be amplified 
and/or detected. The term "target sequence refers to a 
sequence with which a probe or primer will form a stable 
hybrid under desired conditions. 

20 The term "primer" as used herein refers to an 

oligomer which is capable of acting as a point of 
initiation of synthesis of a polynucleotide strand when 
placed under appropriate conditions. The primer will be 
completely or substantially complementary to a region of 

25 the polynucleotide strand to be copied. Thus, under 
conditions conducive to hybridization, the primer will 
anneal to the complementary region of the analyte strand. 
Upon addition of suitable reactants, (e.g., a polymerase, 
nucleotide triphosphates, and the like), the primer is 

30 extended by the polymerizing agent to form a copy of the 
analyte strand. The primer may be single -stranded, or 
alternatively may be partially or fully double- stranded. 

The terms "analyte polynucleotide" and 7 analyte 
strand" refer to a single- or double -stranded nucleic 
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acid molecule which is suspected of containing a target 
sequence, and which may be present in a biological 
sample. 

As used herein, a "biological sample" refers to 
5 a sample of tissue or fluid isolated from an individual, 
including but not limited to, for example, plasma, serum, 
spinal fluid, lymph fluid, the external sections of the 
skin, respiratory, intestinal, and genitourinary tracts, 
tears, saliva, milk, blood cells, tumors, organs, and 

10 also samples of in vitro cell culture constituents 
(including but not limited to conditioned medium 
resulting from the growth of cells in cell culture 
medium, putatively virally infected cells, recombinant 
cells, and cell components). 

15 As used herein, the term "oligomer" refers to 

primers and to probes. The term oligomer does not 
connote the size of the molecule. However, typically 
oligomers are no greater than 1000 nucleotides, more 
typically are no greater than 500 nucleotides, even more 

20 typically are no greater than 250 nucleotides; they may 
be no greater than 100 nucleotides, and may be no greater 
than 75 nucleotides, and also may be no greater than 50 
nucleotides in length. 

The term "coupled" as used herein refers to 

25 attachment by covalent bonds or by strong non-covalent 
interactions (e.g., hydrophobic interactions, hydrogen 
bonds, etc.). Covalent bonds may be, for example, ester, 
ether, phosphoester, amide, peptide, imide, carbon- sulfur 
bonds, carbon- phosphorus bonds, and the like. 

30 The term "support" refers to any solid or semi- 

solid surface to which a desired polypeptide or 
polynucleotide may be anchored. Suitable supports 
include glass, plastic, metal, polymer gels, and the 
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like, and may take the form of beads, wells, dipsticks, 
membranes , and the like. 

The term "label" as used herein refers to any 
atom or moiety which can be used to provide a detectable 
5 (preferably quantifiable) signal, and which can be 
attached to a polynucleotide or polypeptide. 

The term "type 4 pilin" as used herein refer to 
pilins that contain a conserved amino terminal 
hydrophobic domain beginning with an amino -terminal 

10 phenylalanine that is methylated upon processing and 

secretion of the pilin. Another characteristic feature 
of type 4 pilins is that in the propilin form they 
contain similar six- or seven- amino acid long leader 
peptides, which are much shorter than typical signal 

15 sequences. Type 4 pilins are expressed by several 
bacterial genuses, including Neisseria, Moraxella, 
Ba.cteroid.es , and Pseudomonas. Species within these 
genuses which express type 4 pilins are, for example, N. 
gonorrhoeae, N. meningitidis, M. bovis, B. nodosus, and 

20 P. aeruginosa. As used herein, the term "type 4 pilin" 
also includes the Tcp pilin of Vibrio, (for example, V. 
cholerae) , that is highly homologous to the type 4 pilins 
of other genuses. Tcp pilin contains the characteristic 
amino- terminal hydrophobic domain as well as having a 

25 modified N- terminal amino acid that in this case may be a 
modified methionine because the Tcp pilin gene encodes a 
methionine residue at the position where all the others 
encode a phenylalanine. Precursor TcpA contains a much 
longer leader sequence than typical type 4 propilins but 

30 retains homology in the region surrounding the processing 
site. 

The term n pilC n as used herein refers to a gene 
encoding a polypeptide involved in the assembly of type 4 
pilin, which may also be required for attachment of the 
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pilin, and that is comprised of an epitope that is 
innrainologically identifiable with an epitope in PilC of 
N. gonorrhae or N. meningitidis. Included within this 
term is any homologous region from Vibrio, tcpC. 
5 As used herein the term n PilC n refers to a 

polypeptide encoded within pile, and includes TcpC of 
Vibrio. 

The description of the method to retrieve the 
DNA sequences is mostly of historical interest. The 

10 resultant sequences (and their complements) are provided 
herein, and the sequences, or any portion thereof, could 
be prepared using synthetic methods, or by a combination 
of synthetic methods with retrieval of partial sequences 
using methods similar to those described herein. 

15 The description infra, of "walking" the genome 

by isolating overlapping DNA sequences from the N. 
gonorrheae lambda gt-11 library and from an EMBL3 library 
provides one method by which DNAs corresponding to the 
pile genomes from, inter alia, N. gonorrheae and N. 

20 meningitidis, respectively, may be isolated. However, 

given the information provided herein, other methods for 
isolating pilC DNAs from these species, as well as from 
species of other genuses which have type 4 pilin are 
obvious to one of skill in the art. 

25 Characterization of the genes of the pilC loci 

has provided information on the polypeptides encoded 
therein, and on the control of their expression. 
Even though Type 4 pili have been extensively studied in 
several laboratories, little is known about their 

30 assembly. The presence of a specific assembly machinery 
for this class of pili is evident from the fact that the 
pilin gene of B. nodosus and M. bovis can be properly 
processed and assembled into a pilus in P. aeruginosa but 
not in E. coli (Ellerman et al., 1986; Mattick et al., 
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1987; Beard et al., 1990). Furthermore , the recent 
genetic characterization of TCP pill of Vibrio cholerae 
has revealed that a number of closely linked genes are 
required for pilin processing and assembly into a 
structure (Taylor et al., 1988). The TCP pilin does not 
carry an N-methylphenylalanine but its primary sequence 
is highly homologous to the Type 4 class of pilins. 

The N. gonorrhoeae pilus facilitates adherence 
of the bacterium to a number of eukaryotic cell types 
(Watt et al., 1980) and is thought to play a role in 
bacterial interaction with neutrophils (Fischer and Rest, 
1988) . The pilin is encoded from one or two pilE loci 
(Meyer et al., 1984; Swanson et al., 1986) which most 
likely each form a monocistronic operon. Hence, there 
15 have been no suggestions that genes closely linked to 
pilE are involved in pilus assembly. A dispersed 
location of genes involved in gonococcal pilus assembly 
as well as the rapid occurrence of nonpiliated variants 
generated via recombination with pilin sequences from 
20 silent loci, pilS, have made it extremely difficult to 
identify putative assembly genes for gonococcal pili. 

The PilC protein described herein is a protein 
encoded within a pilC or equivalent (for example, tcpC) 
locus or gene. In N. gonorrhoeae MS11 and most other 
25 gonococcal strains the PilC protein is expressed in small 
amounts. It is the only protein that is enriched in 
highly purified preparations of MS11 pili- PilC was not 
released from a nonpilated MSH (P'n) variant using the 
same procedure suggesting that this protein interacts 
30 with the polymerized pilus fiber. 

DNA sequence analysis of the cloned pilCl gene 
revealed one long open reading frame that was out of 
frame with its' putative AUG initiation codon and 5' end 
encoding the signal peptide. Minute amounts of PilC were 
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expressed in E. coli from pABJ04. Gel purified PilC from 
MSll contained a lysine residue in position four, whereas 
pilCl had a glutamine codon at this position. A lysine 
codon was, however, found at position four in a number of 
5 PCR amplified 5' pilC fragments suggesting that these 

fragments represent the 5' end of pilC2, which then must 
be ON in MSll. The finding that a miniTnCm insertion in 
pilC2 abolished PilC expression, whereas insertional 
inactivation of pilCl did not abolish PilC expression 

10 further argues that pilCl is translationally out of frame 
and pilC2 translationally in frame in the MSll variant we 
are studying. 

PCR amplified fragments of pilCl and pilC2 in 
MSll differed in the number of G residues found in the G 

15 tract. Only 11 or 12 Gs were found in pilCl clones 

(which would both generate an OFF phenotype) while 12 or 

13 Gs were found among pilC2 specific clones. Since 
pilC2 is the expressed gene in the MSll variant under 
study, we believe that this variant carries 13 Gs in 

20 pilC2 and 12 Gs in pilCl. The frequency of frameshift 
mutations in each locus is not known. However, the lack 
of 13 Gs among pilCl specific fragments and the lack of 

14 Gs among pilC2 specific fragments suggests that a 
deletion of one G residue occurs at a higher frequency 

25 than the insertion of one G residue. We had expected to 
find amplified fragments from N. gonorrhoeae containing 
10 G residues in the G tract, but found none in the 48 
clones sequenced. If only one G is added or deleted in 
each mutational event, the frequency of G tracts with 10 

30 residues should be low if G tracts normally are 12 or 13 
bp long. 

Frameshif ting in pilCl also occurred in 25. 
coli. In this case, however, two variants with 10 
residues were found out of 12 clones sequenced. It may 
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therefore be that there is a selection against in frame 
variants with 10 Gs in N. gonorrhoeae. A change from 
five glycines to four in the signal peptide may for 
example have an effect on the physical properties of the 
5 precursor form of PilC such that the signal peptide is 
not cleaved off. E. coli strain AA10 is recA. 
Therefore, frameshift mutations in the G tract of pilC 
occurs independent of the RecA protein. 

Translational frameshifting has been shown to 

10 regulate phase and antigenic variation of the gonococcal 
opacity protein PII that is encoded by a number of opa 
loci showing sequence variations. In this system a 
number of pentameric CTCTT repeats are present in the 
region encoding the signal peptide (Stern et al. 1986) . 

15 Variation in the number of repeats is independent of recA 
in N. gonorrhoeae as well as in E. coli (Murphy et al., 
1989) . Variation in the expression of lipopolysaccharide 
epitopes in Haemophilus influenzae was recently explained 
by translational frameshifting created by alterations in 

20 the number of CART repeats occurring in the 5' end of 
licA (Weiser et al., 1989). In Bordetella pertussis 
frameshift mutations in the regulatory vir locus occur in 
a run of C residues positioned internally in the gene 
(Stibitz et al., 1989). The C tract was in this case 

25 varying from 6 (in frame) to 7 residues (out of frame) . 
It is not known if this frameshift mutation is programed 
or not. The pilin gene of Bordetella pertussis was 
recently shown to be preceded by a stretch of Cs. 
Frequent mutations affecting the length of this C tract 

30 influenced the transcriptional activity of the pilin gene 
(Will ems et al., 1990). 

Variation in the number of the CTCTT repeats in 
opa genes was recently suggested to be due to 
recombination- independent slipped strand mispairing 
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(Murphy et al., 1989). Mispairing is thought to occur 
between strands subjected to local denaturation and 
should preferentially occur during replication. A number 
of unusual DNA structures (cruciform, Z form, H form) 
5 have been shown to form in vitro within a variety of 
specific DNA sequences. Under normal conditions the B 
form is the most favorable thermodynamically ( Frank - 
Kamenetskii and Vologodskii, 1984) . Transition to 
alternative conformations requires specific external 

10 conditions, supercoiling being the most physiologic. 
Single stranded (dG) n and (dC) n strands renature more 
slowly than complementary strands with arbitrary 
sequences, and methylation experiments suggest that a 
poly dG chain may form a hairpin- like structure 

15 stabilized by G-G bp (Panyutin et al., 1990) . 

Four variant sequences differing outside the G 
tract were obtained by PGR amplification of the 5' end of 
pile from four J7. gonorrhoeae strains. The region 5' of 
the G tract was invariant, as was the 3' end of the 

20 amplified region. All variation was confined to a region 
located 3' of the G tract. At least some of these 
sequence variations can be explained by mismatch pairing 
events. Thus, the addition of four nucleotides distal to 
the G tract in variant sequence 4 is possible to explain 

25 by a two step mispairing event occurring within variant 
sequence 2. Slip strand mispairing between the two CA 
residues in - GGCGCAGGCGCA- would yield 
-GGCGCAGGCGCAGGCGCA- . A second mispairing event 
occurring between the two C- residues at positions 3 and 5 

30 gives rise to the sequence -GGCAGGCGCAGGCGCA- present in 
variant 4. It may therefore be that a sequence close to 
a poly(G) tract is prone to slipped strand mispairing. 

Gonococcal pilus phase variation is associated 
with an altered nucleotide sequence of pilE via 



WO 92/13871 



PCT/US92/00863 



-28- 



10 



recombinations with silent pilS sequences (Haas and 
Meyer, 1986, Swanson et al., 1986). An irreversible 
switch OFF in pilus expression results from deletions of 
the 5' coding and control regions of the pilE locus 
(Swanson et al., 1985). Reversible gonococcal pilus 
phase variation is associated with nucleotide changes in 
pilE resulting in an altered pilin product. It has been 
suggested that the pilins of these variants are assembly 
defective (Bergstrom et al., 1986; Swanson et al., 1986; 
Hill et al., 1990). Here we present evidence that switch 
OFF and ON of PilC expression causes pilus expression to 
phase vary. Five out of five P", pilin producing 
descendants from MSll^ (P + , PilC + ) that expressed pilin 
did not express PilC. All tested P + revertants from the 
15 five P", PilC" variants had regained expression of PilC. 
The pilin of one nonpiliated PilC OFF-switcher (variant 
8) differed by eight amino acids from that of the parent. 
The fact that one piliated PilC* backswitcher (8:1) 
expressed a pilin identical in sequence to the 
20 nonpiliated variant (8) strongly suggests that the 

regained expression of pili is due to an ON- switch in 
PilC expression. The above results also imply that the 
nonpiliated phenotype of variant 8 is not due to the 
alterations in the pilin relative to the parental strain 
25 but to an OFF-switch of PilC. The finding that mTnCm 
insertions resulted in P + colonies when inserted into 
pilCl and P" colonies when inserted into the actively 
expressing p±lC2 locus offers further evidence that PilC 
is essential for the biogenesis of gonococcal pili. P , 
pilC2i :mTnCm-12 insertion mutants reverted to P + colony 
morphology at a low frequency. These revertants most 
likely represent f rameshif ting mutants in pilCl resulting 
in expression of PilC from this locus. A double mutant 
in pilCl and pilC2 was stably nonpiliated, expressed 
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pilin, but expressed amounts of pilin that did not 
express any pili when examined by transmission electron 
microscopy. It is therefore believed that out of frame 
mutations of both pilCl and pilC2 will abolish pili 
5 formation. 

At this stage we cannot exclude the possibility 
that some PilC" variants from MSII mk (P + # PilC") are 
generated by transformation of pilCl sequences and 
homologous recombination with pilC2 thus generating 

10 variants with two pilCl 5' ends at both pilC loci. PilC + 
revertants from PilC" clones must, however, all be due to 
frameshift mutations in either pilCl or pilC2. 

We propose that PilC forms an outer membrane 
pore or assembly center enabling the pilin subunits to be 

15 assembled and translocated across the outer membrane 

analogous to the proposed function of the high molecular 
weight proteins required for the assembly of 
enterobacterial pili (the latter of which is discussed in 
Norgren et al., 1987). Alternatively, PilC may act as an 

20 initiator for polymerization. In the latter case PilC 
would be expected to be located at the tip of the 
polymerized pilus. 

It is possible that the alternate expression of 
PilC from two structurally different pilC loci is yet 

25 another example of antigenic variation in Neisseria 
gonorrhoeae. It is, however, possible that this 
variation could have functional implications as well. 
Each class of E. coli pili utilizes a different outer 
membrane pore/assembly protein. Hence, pilin subunits 

30 and/or periplasmic chaperone complexes may specifically 
interact with an exposed region of the protein allowing 
polymerization of pilus subunit proteins. The repertoire 
of antigenic variants of gonococcal pilins is vast 
(Hagblom et al., 1985). It may be that only certain 
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pilin variants are assembled via PilCl and PilC2 
respectively. Alternatively, if PilC acts as an 
initiator it could also possess other properties such as 
being involved in Pilus mediated attachment. 
5 In one embodiment of the invention, 

immunogenically active polypeptides encoded within pilC 
are prepared. The availability of pilC DNA sequences, 
either those isolated by utilizing the DNA sequences 
described in the Examples, or nucleotide sequences 
10 derived therefrom (including segments and modifications 
of the sequence) , permits the construction of expression 
vectors encoding immunologically reactive regions of the 
polypeptide encoded in either strand. Immunological 
reactivity may be determined by immunoassay using 
15 antibodies raised to PilC. Fragments encoding the 
desired polypeptides are derived from the DNA clones 
using conventional restriction digestion or by synthetic 
methods, and are ligated into vectors which may, for 
example, contain portions of fusion sequences such as 
20 beta-galactosidase or superoxide dismutase (SOD) , prefer- 
ably SOD. Methods and vectors which are useful for the 
production of polypeptides which contain fusion sequences 
of SOD are described in European Patent Office 
Publication number 0196056, published October 1, 1986. 
25 Any desired portion of the pilC DNA containing an open 

reading frame, in either sense strand, can be obtained as 
a recombinant polypeptide, such as a mature or fusion 
protein; alternatively, a polypeptide encoded in the DNA 
can be provided by chemical synthesis. 
30 The DNA encoding the desired polypeptide, 

whether in fused or mature form, and whether or not 
containing a signal sequence to permit secretion, may be 
ligated into expression vectors suitable for any 
convenient host. Both eukaryotic and prokaryotic host 
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polypeptides, and a summary of some of the more common 
control systems and host cell lines is given infra. The 
polypeptide is then isolated from lysed cells or from the 
5 culture medium and purified to the extent needed for its 
intended use. Purification may be by techniques known in 
the art, for example, differential extraction, salt 
fractionation, chromatography on ion exchange resins, 
affinity chromatography, centrifugation, and the like. 

10 See, for example, Methods in Enzymology for a variety of 
methods for purifying proteins. Such polypeptides can be 
used as diagnostics, or those which give rise to 
neutralizing antibodies may be formulated into vaccines. 
Antibodies raised against these polypeptides can also be 

15 used as diagnostics, or for passive immunotherapy. 

The PilC antigens may also be isolated from 
meningococci and from gonococci. The bacteria may be 
grown by conditions known in the art, some of which are 
described infra. In addition, a method for isolating 

20 PilC from gonococci is described infra. 

In another embodiment of the invention, the 
immunoreactive polypeptides may be conjugated with 
carrier. An antigenic region of a polypeptide is gener- 
ally relatively small- -typically 8 to 10 amino acids or 

25 less in length. Fragments of as few as 5 amino acids may 
characterize an antigenic region. These segments may 
correspond to regions of PilC antigen. Accordingly, 
using the DNAs of pile as a basis, DNAs encoding short 
segments of PilC polypeptides can be expressed 

30 recombinantly either as fusion proteins, or as isolated 
polypeptides. In addition, short amino acid sequences 
can be conveniently obtained by chemical synthesis. In 
instances wherein the synthesized polypeptide is . 
correctly configured so as to provide the correct 
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epitope, but is too small to be immunogenic, the 
polypeptide may be linked to a suitable carrier. 

A number of techniques for obtaining such link- 
age are known in the art, including the formation of 

5 disulfide linkages using 

N-succinimidyl-3-(2-pyridyl-thio)propionate (SPDP) and 

succinimidyl 

4- (N-maleimido-methyl)cyclohexane-l-carboxylate (SMCC) 
obtained from Pierce Company, Rockford, Illinois, (if the 

10 peptide lacks a sulfhydryl group, this can be provided by 
addition of a cysteine residue.) These reagents create a 
disulfide linkage between themselves and peptide cysteine 
residues on one protein and an amide linkage through the 
epsilon-amino on a lysine, or other free amino group in 

15 the other. A variety of such disulfide/amide- forming 
agents are known. See, for example, Immun. Rev. (1982) 
£2:185. Other bifunctional coupling agents form a 
thioether rather than a disulfide linkage. Many of these 
thio- ether- forming agents are commercially available and 

20 include reactive esters of 6-maleimidocaproic acid, 
2-bramoacetic acid, 2-iodoacetic acid, 
4- (N-maleimido-methyl)cyclohexane-l-carboxylic acid, and 

the like. The carboxyl groups can be activated by 

combining them with succinimide or 
25 l-hydroxyl -2 -nitro-4- sulfonic acid, sodium salt. 

Additional methods of coupling antigens employs the 

rotavirus/ "binding peptide" system described in EPO Pub. 

No. 259,149, the disclosure of which is incorporated 

herein by reference. The foregoing list is not meant to 
30 be exhaustive, and modifications of the named compounds 

can clearly be used. 

Any carrier may be used which does not itself 

induce the production of antibodies harmful to the host. 

Suitable carriers are typically large, slowly metabolized 
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macromolecules such as proteins; polysaccharides, such as 
latex functionalized sepharose, agarose, cellulose, cel- 
lulose beads and the like; polymeric amino acids, such as 
polyglutamic acid, poly lysine, and the like; amino acid 
5 copolymers; and inactive virus particles or attenuated 
bacteria of other strains, for example, those of 
Salmonella. Especially useful protein substrates are 
serum albumins, keyhole limpet hemocyanin, immunoglobulin 
molecules, thyroglobulin, ovalbumin, tetanus toxoid, and 

10 other proteins well known to those skilled in the art. 

In addition to full-length PilC proteins, 
polypeptides comprising truncated PilC amino acid 
sequences encoding at least one immunologically reactive 
epitope are useful immunological reagents. For example, 

15 polypeptides comprising such truncated sequences can be 
used as reagents in an immunoassay. These polypeptides 
also are candidate subunit antigens in compositions for 
antiserum production or vaccines. While these truncated 
sequences can be produced by various known treatments of 

20 native bacterial protein, it is generally preferred to 
make synthetic or recombinant polypeptides comprising a 
PilC sequence • Polypeptides comprising these truncated 
PilC sequences can be made up entirely of PilC sequences 
(one or more epitopes, either contiguous or 

25 noncontiguous) , or PilC sequences and heterologous 
sequences in a fusion protein. Useful heterologous 
sequences include sequences that provide for secretion 
from a recombinant host, enhance the immunological 
reactivity of the PilC epitope (s), or facilitate the 

30 coupling of the polypeptide to an immunoassay support or 
a vaccine carrier. See, e.g., EPO Pub. No. 116,201; 
U.S. Pat. No. 4,722,840; EPO Pub. No. 259,149; U.S. 
Pat. No. 4,629,783, the disclosures of which are 
incorporated herein by reference. 
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The size of polypeptides comprising the 
truncated Pile sequences can vary widely, the minimum 
size being a sequence of sufficient size to provide an 
immunologically reactive PilC epitope, while the maximum 
5 size is not critical. For convenience, the maximum size 
usually is not substantially greater than that required 
to provide the desired PilC epitopes and function (s) of 
the heterologous sequence, if any. Typically, the 
truncated PilC amino acid sequence will range from about 

10 5 to about 100 amino acids in length. More typically, 

however, the PilC sequence will be a maximum of about 50 
amino acids in length, preferably a maximum of about 30 
amino acids. It is usually desirable to select PilC 
sequences of at least about 10, 12 or 15 amino acids, up 

15 to a maximum of about 20 or 25 amino acids. 

Truncated PilC amino acid sequences comprising 
epitopes can be identified in a number of ways. For 
example, the entire PilC protein sequence can be screened 
by preparing a series of short peptides that together 

20 span the entire protein sequence. By starting with, for 
example, lOOmer polypeptides, it would be routine to test 
each polypeptide for the presence of epitope (s) showing a 
desired reactivity, and then testing progressively 
smaller and overlapping fragments from an identified 

25 lOOmer to map the epitope of interest. Screening such 
peptides in an immunoassay is within the skill of the 
art. It is also known to carry out a computer analysis 
of a protein sequence to identify potential epitopes, and 
then prepare oligopeptides comprising the identified 

30 regions for screening. 

In another embodiment of the invention, the 
immunogenicity of the epitopes of PilC may also be 
enhanced by preparing them in mammalian or yeast systems 
fused with or assembled with particle- forming proteins 
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such as, for example, that associated with hepatitis B 
surface antigen. See, e.g., U.S. Pat. No. 4,722,840. 
Constructs wherein the PilC epitope is linked directly to 
the particle- forming protein coding sequences produce 
5 hybrids which are immunogenic with respect to the PilC 
epitope. In addition, all of the vectors prepared 
include epitopes specific to HBV, having various degrees 
of immunogenicity, such as, for example, the pre-S 
peptide. Thus, particles constructed from particle 

10 forming protein which include PilC sequences are 

immunogenic with respect to the microorganism encoding 
the PilC epitope (for example, Neisseria. , Vibrio, 
Moraxella, Bacteroides, or Pseudomonas) and HBV. 

Hepatitis surface antigen (HBSAg) has been 

15 shown to be formed and assembled into particles in 5. 

cerevisiae (Valenzuela et al. (1982)), as well as in, for 
example, mammalian cells (Valenzuela, P., et al. (1984)). 
The formation of such particles has been shown to enhance 
the immunogenicity of the monomer subunit. The 

20 constructs may also include the immunodominant epitope of 
HBSAg, comprising the 55 amino acids of the presurface 
(pre-S) region. Neurath et al. (1984) . Constructs of 
the pre-S-HBSAg particle expressible in yeast are 
disclosed in EPO 174,444, published March 19, 1986; 

25 hybrids including heterologous viral sequences for yeast 
expression are disclosed in EPO 175,261, published March 
26, 1966. These constructs may also be expressed in 
mammalian cells such as Chinese hamster ovary (CHO) cells 
using an SV40-dihydrof olate reductase vector (Michelle et 

30 al. (1984)). 

In addition, portions of the particle -forming 
protein coding sequence may be replaced with codons 
encoding a PilC epitope. In this replacement, regions 
which are not required to mediate the aggregation of the 
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units to form immunogenic particles in yeast or mammals 
can be deleted, thus eliminating additional HBV antigenic 
sites from competition with the PilC epitope. 

In another embodiment of the invention, the 
5 immunoreactive polypeptides encoded in pile are prepared 
into vaccines. Vaccines may be prepared from one or more 
immunogenic polypeptides derived from pile. If 
recombinant, these polypeptides may be expressed in 
various host cells (e.g., bacteria, yeast, insect, or 

10 mammalian cells) , or alternatively may be isolated from 
the bacterial preparations. In addition to the above, it 
is also possible to prepare live vaccines of attenuated 
microorganisms which express one or more recombinant 
polypeptides derived from the pile gene. Suitable 

15 attenuated microorganisms are known in the art and 

include, for example, viruses (e.g., vaccinia virus (see 
Brown et al. (1986)), as well as bacteria. 

The preparation of vaccines which contain an 
immunogenic polypeptide (s) as active ingredients, is 

20 known to one skilled in the art. Typically, such 

vaccines are prepared as injectables, either as liquid 
solutions or suspensions; solid forms suitable for solu- 
tion in, or suspension in, liquid prior to injection may 
also be prepared. The preparation may also be 

25 emulsified, or the protein encapsulated in liposomes. 

The active immunogenic ingredients are often mixed with 
excipients which are pharmaceutically acceptable and 
compatible with the active ingredient. Suitable 
excipients are, for example, water, saline, dextrose, 

30 glycerol, ethanol, or the like and combinations thereof. 
In addition, if desired, the vaccine may contain minor 
amounts of auxiliary substances such as wetting or 
emulsifying agents, pH buffering agents, and/or adjuvants 
which enhance the effectiveness of the vaccine. Examples 
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of adjuvants which may be effective include but are not 
limited to: aluminum hydroxide, N- acetyl -muramyl- 
L-threonyl-D-isoglutamine (thr-MDP) , N-acetyl-nor- 
muranyl-L-alanyl-D-isoglutamine (CGP 11637, referred to 
5 as nor-MDP) , N-acetylmuramyl-L-alanyl-D-isoglutaminyl- 

L -alanine- 2- (1' -2' -dipalmitoyl-sn-glycero-3-hydroxyphosph 
oryloxy)-ethylamine (CGP 19835A, referred to as MTP-PE) , 
and RIBI, which contains three components extracted from 
bacteria, monophosphoryl lipid A, trehalose dimycolate 

10 and cell wall skeleton (MPL+TDM+CWS) in a 2% 

squalene/Tween 80 emulsion. The effectiveness of an 
adjuvant may be determined by measuring the amount of 
antibodies directed against an immunogenic polypeptide 
containing a PilC immunoreactive sequence resulting from 

15 administration of this polypeptide in vaccines which are 
also comprised of the various adjuvants. 

The vaccines are conventionally administered 
parenterally, by injection, for example, either 
subcutaneous ly or intramuscularly. Additional 

20 formulations which are suitable for other modes of 

administration include suppositories and, in some cases, 
oral formulations. For suppositories, traditional 
binders and carriers may include, for example, 
polyalkylene glycols or triglycerides; such suppositories 

25 may be formed from mixtures containing the active 

ingredient in the range of 0.5% to 10%, preferably l%-2%. 
Oral formulations include such normally employed 
excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodium 

30 saccharine, cellulose, magnesium carbonate, and the like. 
These compositions take the form of solutions, suspen- 
sions, tablets, pills, capsules, sustained release 
formulations or powders and contain 10%- 95% of active . 
ingredient, preferably 25%- 70%. 
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The proteins may be formulated into the vaccine 
as neutral or salt forms. Pharmaceutical^ acceptable 
salts include the acid addition salts (formed with free 
amino groups of the peptide) and which are formed with 
5 inorganic acids such as, for example, hydrochloric or 
phosphoric acids, or such organic acids such as acetic, 
oxalic, tartaric, maleic, and the like. Salts formed 
with the free carboxyl groups may also be derived from 
inorganic bases such as, for example, sodium, potassium, 

10 ammonium, calcium, or ferric hydroxides, and such organic 
bases as isopropylamine, trimethylamine, 2-ethylamino 
ethanol, histidine, procaine, and the like. 

The vaccines are administered in a manner 
compatible with the dosage formulation, and in such 

15 amount as will be prophylactically and/or therapeutically 
effective. The quantity to be administered, which is 
generally in the range of 5 micrograms to 250 micrograms 
of antigen per dose, depends on the subject to be 
treated, capacity of the subject's immune system to 

20 synthesize antibodies, and the degree of protection 

desired. Precise amounts of active ingredient required 
to be administered may depend on the judgment of the 
practitioner and may be peculiar to each subject. 

The vaccine may be given in a single dose 

25 schedule, or preferably in a multiple dose schedule. A 
multiple dose schedule is one in which a primary course 
of vaccination may be with 1-10 separate doses, followed 
by other doses given at subsequent time intervals 
required to maintain and or reenforce the immune 

30 response, for example, at 1-4 months for a second dose, 
and if needed, a subsequent dose(s) after several months. 
The dosage regimen will also, at least in part, be 
determined by the need of the individual and be dependent 
upon the judgment of the practitioner. 
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In addition, the vaccine containing the im- 
munogenic antigen (s) derived from pile may be 
administered in conjunction with other immunoregulatory 
agents, for example, immune globulins. 
5 Another embodiment of the invention are 

antibodies which react immunologically with PilC 
epitopes. The immunogenic polypeptides prepared as 
described above are used to produce antibodies, including 
polyclonal and monoclonal. If polyclonal antibodies are 

10 desired, a selected mammal (e.g., mouse, rabbit, goat, 

horse, etc.) is immunized with an immunogenic polypeptide 
bearing a PilC epitope (s). Serum from the immunized 
animal is collected and treated according to known 
procedures. If serum containing polyclonal antibodies to 

15 a PilC epitope (i*e., an epitope encoded within pilC) 
contains antibodies to other antigens, the polyclonal 
antibodies can be purified by immunoaf f inity 
chromatography. Techniques for producing and processing 
polyclonal antisera are known in the art, see for 

20 example, Mayer and Walker (1987) . 

Monoclonal antibodies directed against PilC 
epitopes can also be readily produced by one skilled in 
the art. The general methodology for making monoclonal 
antibodies by hybridomas is well known. Immortal 

25 antibody-producing cell lines can be created by cell 
fusion, and also by other techniques such as direct 
transformation of B lymphocytes with oncogenic DNA, or 
transfection with Epstein-Barr virus. See, e.g., M. 
Schreier et al. (1980); Hammerling et al. (1981); Kennett 

30 et al. (1980); see also . U.S. Patent Nos. 4,341,761; 

4,399,121; 4,427,783; 4,444,887; 4,466,917; 4,472,500; 
4,491,632; and 4,493,890. Panels of monoclonal 
antibodies produced against PilC epitopes can be screened 
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f or various properties; i.e., for isotype, epitope affin- 
ity, etc. 

Antibodies, both monoclonal and polyclonal, 
which are directed against PilC epitopes are particularly 
5 useful in diagnosis, and those which are neutralizing are 
useful in passive immunotherapy. Methods for introducing 
antibodies into an individual to accomplish passive 
immunotherapy are known in the art. In addition, 
monoclonal antibodies, in particular, may be used to 

10 raise ant i-idio type antibodies. 

Anti-idiotype antibodies are immunoglobulins 
which carry an "internal image" of the antigen of the 
infectious agent against which protection is desired. 
See, for example, Nisonoff, A., et al. (1981) and 

15 Dreesman et al. (1985) . Techniques for raising 

anti-idiotype antibodies are known in the art. See, for 
example, Grzych (1985), MacNamara et al. (1984), and 
Uytdehaag et al. (1985) . These anti-idiotype antibodies 
may also be useful for treatment, vaccination and/or 

20 diagnosis of the relevant microorganism encoding the 
antigen of interest, (for example, Ifeisseria, 
Pseudomonas, Moraxella, Bacteroides, or Vibrio) as well 
as for an elucidation of the immunogenic regions of PilC. 
Another embodiment of the invention concerns 

25 immunoassays and diagnostic kits. The polypeptides which 
contain epitopes encoded in pile which are immunoreactive 
with anti-PilC antibodies in biological samples are 
useful in immunoassays to detect presence of anti-PilC 
antibodies, or the presence of the relevant microorganism 

30 or its antigens in biological samples. Design of the 

immunoassays is subject to a great deal of variation, and 
many formats are known in the art. The immunoassay will 
utilize a polypeptide comprised of at least one epitope 
derived from PilC or encoded in pilC. In one embodiment, 
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the immunoassay uses a combination of epitopes including 
the one derived from PilC or encoded in pile. These 
epitopes may be derived from the same or from different 
polypeptides, and may be in separate recombinant or 
5 natural polypeptides, or together in the same recombinant 
polypeptides* An immunoassay may use, for example, a 
monoclonal antibody directed towards an epitope (s), a 
combination of monoclonal antibodies directed towards 
epitopes of one antigen, monoclonal antibodies directed 

10 towards epitopes of different antigens, polyclonal 
antibodies directed towards the same antigen, or 
polyclonal antibodies directed towards different 
antigens. Protocols may be based, for example, upon 
competition, or direct reaction, or sandwich type assays. 

15 Protocols may also, for example, use solid supports, or 
may be by immunoprecipitation. Most assays involve the 
use of labeled antibody or polypeptide; the labels may 
be, for example, enzymatic, fluorescent, 
chemiluminescent, radioactive, or dye molecules. Assays 

20 which amplify the signals from the probe are also known; 
examples of which are assays which utilize biotin and 
avidin, and enzyme- labeled and mediated immunoassays, 
such as ELISA assays. 

Typically, em immunoassay for an anti-PilC 

25 antibody (s) will involve selecting and preparing the test 
sample suspected of containing the antibodies, such as a 
biological sample, then incubating it with an 
imiminoreactive (also called antigenic) polypeptide (s) 
containing at least one epitope encoded in pile. The 

30 incubation is under conditions that allow 

antigen- antibody complexes to form. Suitable incubation 
conditions are well known in the art. Subsequent to the 
incubation, complexes which are formed which contain the 
imiminoreactive polypeptide are detected. The immunoassay 
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may be, without limitations, in a heterogenous or in a 
homogeneous format, and of a standard or competitive 
type. 

In a heterogeneous format, the polypeptide is 
5 typically bound to a solid support to facilitate separa- 
tion of the sample from the polypeptide after incubation. 
Examples of solid supports that can be used are nitro- 
cellulose (e.g., in membrane or microtiter well form), 
polyvinyl chloride (e.g., in sheets or microtiter wells), 

10 polystyrene latex (e.g., in beads or microtiter plates, 
polyvinylidine fluoride (known as Immulon) , diazotized 
paper, nylon membranes, activated beads, and Protein A 
beads. For example, Dynatech Immulon 1 or Immulon 
microtiter plates or 0.25 inch polystyrene beads (Preci- 

15 sion Plastic Ball) can be used in the heterogeneous 
format. The solid support containing the antigenic 
polypeptide is typically washed after separating it from 
the test sample, and prior to detection of bound antibod- 
ies. Both standard and competitive formats are known in 

20 the art. 

In a homogeneous format, the test sample is 
incubated with antigen in solution. For example, it may 
be under conditions that will precipitate any 
antigen- antibody complexes which are formed. Both 

25 standard and competitive formats for these assays are 
known in the art. 

In a standard format, the amount of anti-PilC 
antibodies forming the antibody-antigen complex is 
directly monitored. This may be accomplished by 

30 determining whether labeled anti -xenogeneic (e.g., 

anti-human) antibodies which recognize an epitope on 
anti _PilC antibodies will bind due to complex formation. 
In a competitive format, the amount of anti-PilC 
antibodies in the sample is deduced by monitoring the 
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competitive effect on the binding of a known amount of 
labeled antibody (or other competing ligand) in the 
complex. 

Complexes formed comprising anti-PilC antibody 
5 (or, in the case of competitive assays, the amount of 
competing antibody) are detected by any of a number of 
known techniques, depending on the format. For example, 
unlabeled anti-PilC antibodies in the complex may be 
detected using a conjugate of antixenogeneic Ig complexed 

10 with a label, (e.g., an enzyme label). 

In immunoassays where Pile polypeptides are the 
analyte, the test sample, which may be a biological 
sample, is incubated with anti-PilC antibodies under 
conditions that allow the formation of antigen- antibody 

15 complexes. Various formats can be employed. For 
example, a "sandwich assay" may be employed, where 
antibody bound to a solid support is incubated with the 
test sample; washed; incubated with a second, labeled 
antibody to the analyte, and the support is washed again. 

20 Analyte is detected by determining if the second antibody 
is bound to the support. In a competitive format, which 
can be either heterogeneous or homogeneous, a test sample 
is usually incubated with antibody and a labeled, 
competing antigen is also incubated, either sequentially 

25 or simultaneously. These and other formats are well 
known in the art. 

The antigenic regions of the polypeptides 
encoded in pile can be mapped and identified by screening 
the antigenicity of expression products of pilC DNAs 

30 which encode portions of the PilC. The expression 

products may be from a variety of expression systems, 
including, for example bacterial systems, yeast systems, 
insect systems, and eukaryotic cell systems. In 
addition, studies giving rise to an antigenicity index 
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and hydrophobicity/hydrophilicity profile give rise to 
information concerning the probability of a region's 
antigenicity. 

Efficient detection systems for infection with 
5 pathogenic microorganisms, (for example, Neisseria, 
Pseudomonas, Bacteroides, Moraxella, or Vibrio) may 
include the use of panels of epitopes. The epitopes in 
the panel may be constructed into one or multiple 
polypeptides. At least one of the epitopes will be 

10 encoded in pilC or derived from PilC. The assays for the 
varying epitopes may be sequential or simultaneous. 

Kits suitable for immunodiagnosis and contain- 
ing the appropriate labeled reagents are constructed by 
packaging the appropriate materials, including the 

15 polypeptides of the invention containing PilC epitopes 
(i.e., epitopes encoded within pilC) or antibodies 
directed against PilC epitopes in suitable containers. 
The kit may also contain other reagents, for example, 
buffer and standard, as well as other materials required 

20 for the conduct of the assay, as well as a suitable set 
of instructions for conducting the assay using the kit 
materials . 

Another embodiment of the invention are 
oligomers. Using the disclosed portions of the pilC DNAs 

25 as a basis, oligomers of approximately 8 nucleotides or 
more can be prepared, either by excision from recombinant 
polynucleotides or by synthetic methods which are known 
in the art. These oligomers can serve as probes for the 
detection (including isolation and/or labeling) of 

30 polynucleotides which contain pilC sequences, and/or as 
primers for the transcription and/or replication of 
targeted pilC sequences. The oligomers contain a 
targeting polynucleotide sequence, which is comprised of 
nucleotides which are complementary to a target pilC 
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nucleotide sequence; the sequence is of sufficient length 
and complementarity with the pilC sequence to form a 
duplex which has sufficient stability for the purpose 
intended. For example, if the purpose is the isolation, 
5 via immobilization, of an analyte containing a target 
pilC sequence, the oligomers would contain a 
polynucleotide region which is of sufficient length and 
complementarity to the targeted pilC sequence to afford 
sufficient duplex stability to immobilize the analyte on 

10 a solid surface, via its binding to the oligomers, under 
the isolation conditions. For example, also, if the 
oligomers are to serve as primers for the transcription 
and/or replication of target pilC sequences in an analyte 
polynucleotide, the oligomers would contain a 

15 polynucleotide region of sufficient length and 

complementarity to a region flanking the targeted pilC 
sequence to allow the polymerizing agent to continue 
replication from the primers which are in stable duplex 
form with the target sequence, under the polymerizing 

20 conditions. The oligomers may contain a minimum of about 
4 contiguous nucleotides which are complementary to a 
targeted pilC sequence; usually the oligomers will 
contain a minimum of about 8 contiguous nucleotides which 
are complementary to the targeted pilC sequence, and 

25 preferably will contain a minimum of about 14 contiguous 
nucleotides which are complementary to the targeted pilC 
sequence . 

The oligomer, however, need not consist only of 
the sequence which is complementary to the targeted pilC 
30 sequence. It may contain in addition, nucleotide 

sequences or other moieties which are suitable for the 
purposes for which the oligomers are used. For example, 
if the oligomers are used as primers for the 
amplification of targeted pilC sequences via the 
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polymerase chain reaction (PCR) , they may contain 
sequences which, when in duplex, form restriction enzyme 
sites which facilitate the cloning of the amplified 
sequences. Other types of moieties or sequences which 
5 are useful of which the oligomers may be comprised or 
coupled to, are those which are known in the art to be 
suitable for a variety of purposes, including the 
labeling of nucleotide probes. 

In the basic nucleic acid hybridization assay, 

10 single-stranded analyte nucleic acid (either DNA or RNA) 
is hybridized to a nucleic acid probe, and resulting 
duplexes are detected. The probes for pllC sequences 
(natural or derived) are a length which allows the 
detection of these sequences by hybridization. While 

15 6-8 nucleotides may be a workable length, sequences of 

10-12 nucleotides are preferred, and about 20 nucleotides 
or more appears optimal. Preferably, these sequences 
will derive from regions which lack heterogeneity. These 
probes can be prepared using routine methods, including 

20 automated oligonucleotide synthetic methods. For use as 
probes, complete complementarity is desirable, although 
it may be unnecessary as the length of the fragment is 
increased. 

For use of such probes as agents to detect the 
25 presence of pilC sequences, the sample to be analyzed 

(which may be biological) may be treated, if desired, to 
extract the nucleic acids contained therein. The 
resulting nucleic acid from the sample may be subjected 
to gel electrophoresis or other size separation 
techniques; alternatively, the nucleic acid sample may be 
dot blotted without size separation. In order to form 
hybrid duplexes with the targeting sequence of the probe, 
the targeted region of the analyte nucleic acid must be 
in single- stranded form. The latter may occur naturally; 
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alternatively, it may be accomplished by denaturation. 
Denaturation can be accomplished by various techniques 
known in the art. Subsequent to denaturation, the 
analyte nucleic acid and probe are incubated under 
conditions which promote stable hybrid formation of the 
target sequence in the probe with the putative targeted 
sequence in the analyte, and the resulting duplexes 
containing probe (s) are detected. 

Detection of the resulting duplex, if any, is 
usually accomplished by the use of labeled probes; 
alternatively, the probe may be unlabeled, but may be 
detectable by specific binding with a ligand which is 
labeled, either directly or indirectly. Suitable labels, 
and methods for labeling probes and ligands are known in 
the art, and include, for example, radioactive labels 
which may be incorporated by known methods (e.g., nick 
translation or kinasing) , biotin, fluorescent groups, 
chemiluminescent groups (e.g., doxetanes, particularly 
triggered dioxetands) , enzymes, antibodies, and the like. 
Variations of this basic scheme are known in the art. 

If the targeted pile sequences are expected to 
be present at relatively low levels, amplification may be 
required for their detection. Such techniques are known 
in the art. For example, the Enzo Biochemical 
Corporation "Bio -Bridge" system uses terminal 
deoxynucleotide transferase to add unmodified 
3' -poly- dT- tails to a DNA probe. The poly dT- tailed 
probe is hybridized to the target nucleotide sequence, 
and then to a biotin-modif ied poly-A. PCT application 
84/03520 and EPA124221 describe a DNA hybridization 
assay in which: (1) analyte is annealed to a 
single- stranded DNA probe that is complementary to an 
enzyme- labeled oligonucleotide; and (2) the resulting 
tailed duplex is hybridized to an enzyme -labeled 
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oligonucleotide. EPA 204510 describes a DNA 
hybridization assay in which analyte DNA is contacted 
with a probe that has a tail, such as a poly-dT tail, an 
amplif ier strand that has a sequence that hybridizes to 
5 the tail of the probe, such as a poly- A sequence, and 
which is capable of binding a plurality of labeled 
strands. A particularly desirable technique may first 
involve amplification of the target pilC sequences. The 
target pilC sequences in sera may be amplified, for 
10 example, to approximately 10 6 sequences/ml. This may be 
accomplished, for example, by the polymerase chain 
reactions (PCR) technique described by Saiki et al. 
(1986), by Mallis, U.S. Patent No. 4,683,195, and by 
Mullis et al. U.S. Patent No. 4,683,202. Amplification 
15 may be prior to, or preferably subsequent to purification 
of the pilC target sequence. For example, amplification 
may be utilized in conjunction with the assay methods 
described in U.S. Patent No. 4,868,105, or if even 
further amplification is desired, in conjunction with the 
20 hybridization system in EPO Publication No. 317,077. 

Generally, in the PCR technique, short 
oligonucleotide primers are prepared which match opposite 
ends of a desired sequence. The sequence between the 
primers need not be known. A sample of polynucleotide is 
25 extracted and denatured, preferably by heat, and 

hybridized with oligomers which are oligonucleotide 
primers, which are present in molar excess. 
Polymerization is catalyzed by a template- and primer- 
dependent polymerase in the presence of deoxynucleotide 
triphosphates (dNTPs) , and may also be in the presence of 
nucleotide analogs. This results in two "long products" 
which contain the respective primers at their 5' -termini, 
covalently linked to the newly synthesized complements of 
the original strands. The replicated DNA is again 
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denatured, hybridized with oligonucleotide primers, 
returned to polymerizing conditions, and a second cycle 
of replication is initiated. The second cycle provides 
the two original strands, the two long products from 
5 cycle 1, and two "short products" replicated from the 
long products. The short products contain sequences 
(sense or antisense) derived from the target sequence, 
flanked at the 5 # - and 3 '-termini with primer sequences. 
On each additional cycle, the number of short products is 

10 replicated exponentially. Thus, this process causes 
amplification of a specific target sequence. 

It will be understood that "primer", as used 
herein, may refer to more than one primer, particularly 
in the case where there is some ambiguity in the 

15 information regarding the terminal .sequence (s) of the 
target region to be amplified. Hence, a "primer" 
includes a collection of primer oligonucleotides 
containing sequences representing the possible variations 
in the sequence or includes nucleotides which allow a 

20 typical base pairing. One of the primer oligomers in 
this collection will be homologous with the end of the 
target sequence. 

The amplified sequence (s) may then be detected 
using a hybridization assay which utilize nucleic acid 

25 ntul timers which bind to single- stranded analyte nucleic 
acid, and which also bind to a multiplicity of 
single- stranded labeled oligonucleotides. A suitable 
solution phase sandwich assay which may be used with 
labeled polynucleotide probes, and the methods for the 

30 preparation of probes is described in EPO 225,807, 
published June 16, 1987. 

The probes can be packaged into diagnostic 
kits. Diagnostic kits include the probe DNA, which may 
be labeled; alternatively, the probe DNA may be unlabeled 
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and the ingredients for labeling may be included in the 
kit in separate containers. The kit may also contain 
other suitably packaged reagents and materials needed for 
the particular hybridization protocol, for example, 
5 standards, buffers, as well as instructions for 
conducting the test using the kit ingredients. 

The pilC DNA sequence information in the clones 
described in the Examples may be used to gain further 
information on the remaining sequence of the pilC gene 

10 from meningococci, for other possible alleles of pilC in 
Neisseria, as well as pile in other relevant genuses and 
species. This information will aid in the 
characterization of the gene, and of its role in 
virulence of the pathogenic forms of microorganisms, 

15 including, for example, Neisseria, Pseudamonas, 

Bacteroides, Moraxella, and Vibrio. Moreover, this 
sequence information can lead to additional 
polynucleotide probes, polypeptides derived from pilC, 
multiple pilC loci, and antibodies directed against PilC 

20 epitopes which would be useful for the diagnosis and/or 
treatment of infections caused by the relevant pathogenic 
microorganisms . 

The DNA sequence information in the 
above-mentioned clones is useful for the design of probes 

25 for the isolation of additional DNA sequences which are 
derived from as yet undefined regions of pilC. For 
example, labeled probes containing a sequence of 
approximately 8 or more nucleotides, and preferably 20 or 
more nucleotides, which are derived from regions close to 

30 the ends of the DNA sequences shown in the Examples. 
These probes may be used to isolate overlapping DNA 
sequences within or adjacent to pilC from DNA libraries 
created from genomes of species having type 4 pilins. 
The resulting overlapping DNAs may then be used to 
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synthesize probes for identification of other overlapping 
fragments which do not necessarily overlap the DNAs whose 
sequences are given in the Examples. Thus, it is 
possible to sequence entire pllC genes utilizing the DNA 
5 sequences provided herein and the technique of isolation 
of overlapping DNAs derived from the pile genes. 

Methods for constructing DNA libraries are 
known in the art, and are discussed infra; for example, a 
method for the construction of pllC libraries in 

10 lambda-gtll is discussed infra in Section IV. A. However, 
DNA libraries which are useful for screening with nucleic 
acid probes may also be constructed in other vectors 
known in the art, for example, lambda-gtlO (Huynh et al. 
(1985)). Another suitable vector for the creation of 

15 libraries may be EMBL3 , which is a replacement vector 
which accepts inserts ranging from 9 to 23 kb in size. 
In general, methods for constructing DNA libraries is 
discussed in Maniatis et al, MOLECULAR CLONING, 2nd 
edition, (1989). 

20 The sequence information derived from these 

overlapping pllC DNAs is useful for determining areas of 
homology and heterogeneity within the pllC gene(s), which 
could indicate the presence of different strains 
gonococci, meningococci, or other hitherto unrecognized 

25 pathogenic forms of Neisseria. It is also useful for the 
design of hybridization probes to detect PilC antigens or 
pile nucleic acids in biological samples. Moreover, the 
overlapping DNAs may be used to create expression vectors 
for polypeptides derived from pile gene(s) . 

30 The pile DNA sequence information may also 

allow the construction of additional bacteriostatic 
agents for treatment of neisserial infections, in that 
they may block the expression of PilC and/or pilin 
assembly. For example, it may be used to derive 
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antisense polynucleotides. Antisense polynucleotides 
molecules are comprised of a complementary nucleotide 
sequence which allows them to hybridize specifically to 
designated regions of genomes or RNAs. Antisense 
5 polynucleotides may include, for example, molecules that 
will block protein translation by binding to mRNA, or may 
be molecules which prevent replication of DNA by 
replicase. They may also include molecules which carry 
agents (non-covalently attached or covalently bound) 

10 which cause the mRNA or genomic DNA to be inactive by 
causing, for example, scissions in these molecules. 
Antisense molecules which are to hybridize to pilC 
derived polynucleotides may be designed based upon the 
sequence information of the pilC DNA sequences provided 

15 herein, including those which would be isolated from 

additional DNA libraries. The antibacterial agents based 
upon anti- sense polynucleotides for pilC may be designed 
to bind with high specificity, to be of increased 
solubility, to be stable, and to have low toxicity. 

20 Hence, they may be delivered in specialized systems, for 
example, liposomes, or by gene therapy. In addition, 
they may include analogs, attached proteins, substituted 
or altered bonding between bases, etc. 

Other types of drugs may be based upon 

25 polynucleotides which "mimic" important control regions 
of the pilC gene, and which may be therapeutic due to 
their interactions with key components of the system 
responsible for expression of the gene. 

In addition to the specific methods described 

30 in the Examples, general methods are known which may be 
used in the practice of the invention. For example, 
general techniques used in extracting the genome from 
bacteria, including Neisseria, preparing and probing a 
DNA library, sequencing clones, constructing expression 
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vectors, transforming cells, performing immunological 
assays such as radioimmunoassays and ELISA assays, for 
rowing cells in culture, and the like are known in the 
rt and laboratory manuals are available describing these 
5 techniques. However, as a general guide, the following 
sets forth some sources currently available for such 
procedures, and for materials useful in carrying them 
out. 

Both prokaryotic and eukaryotic host cells may 

10 be used for expression of desired coding sequences when 
appropriate control sequences which are compatible with 
the designated host are used. Among prokaryotic hosts, 
E. coli is most frequently used. Expression control 
sequences for prokaryotes include promoters, optionally 

15 containing operator portions, and ribosome binding sites. 
Transfer vectors compatible with prokaryotic hosts are 
commonly derived from, for example, pBR322, a plasmid 
containing operons conferring ampicillin and tetracycline 
resistance, and the various pUC vectors, which also 

20 contain sequences conferring antibiotic resistance mark- 
ers. These markers may be used to obtain successful 
transformants by selection. Commonly used prokaryotic 
control sequences include the Beta- lactamase 
(penicillinase) and lactose promoter systems (Chang et 

25 al. (1977)), the tryptophan (trp) promoter system 
(Goeddel et al. (1980)*) and the lambda -derived P L 
promoter and N gene ribosome binding site (Shimatake et 
al. (1981)) and the hybrid tac promoter (De Boer et al. 
(1983)) derived from sequences of the trp and lac UV5 

30 promoters. The foregoing systems are particularly 

compatible with E. coli; if desired, other prokaryotic 
hosts such as strains of Bacillus or Pseudomonas may be 
used, with corresponding control sequences. 

35 



WOW/13871 



PCT/US92/00863 



-54- 

Eukaryotic hosts include yeast and mammalian 
cells in culture systems. Saccharomyces cerevisiae and 
Saccnaromyces carlsbergensis are the most commonly used 
yeast hosts, and are convenient fungal hosts. Yeast 
5 compatible vectors carry markers which permit selection 
of successful transformants by conferring prototrophy to 
auxotrophic mutants or resistance to heavy metals on 
wild- type strains. Yeast compatible vectors may employ 
the 2 micron origin of replication (Broach et al. 

10 (1983)), the combination of CEN3 and ARS1 or other means 
for assuring replication, such as sequences which will 
result in incorporation of an appropriate fragment into 
the host cell genome. Control sequences for yeast 
vectors are known in the art and include promoters for 

15 the synthesis of glycolytic enzymes (Hess et al. (1968) ; 
Holland et al. (1978)), including the promoter for 3 
phosphoglycerate kinase (Hitzeman (1980) ) . Terminators 
may also be included, such as those derived from the 
enolase gene (Holland (1981) ) . Particularly useful 

20 control systems are those which comprise the 

glyceraldehyde-3 phosphate dehydrogenase (GAPDH) promoter 
or alcohol dehydrogenase (ADH) regulatable promoter, 
terminators also derived from GAPDH, and if secretion is 
desired, leader sequence from yeast alpha factor. In 

25 addition, the transcriptional regulatory region and the 
transcriptional initiation region which are operably 
linked may be such that they are not naturally associated 
in the wild- type organism. These systems are described 
in detail in EPO 120,551, published October 3, 1984; EPO 

30 116,201, published August 22, 1984; and EPO 164,556, 
published December 18, 1985. 

Mammalian cell lines available as hosts for 
expression are known in the art and include many 
immortalized cell lines available from the American Type 
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Culture Collection (ATCC) , including HeLa cells, Chinese 
hamster ovary (CHO) cells, baby hamster kidney (BHK) 
cells, and a number of other cell lines. Suitable 
promoters for mammalian cells are also known in the art 
5 and include viral promoters such as that from Simian 

Virus 40 (SV40) (Fiers (1978)), Rous sarcoma virus (RSV) , 
adenovirus (ADV) , and bovine papilloma virus (BPV) . 
Mammalian cells may also require terminator sequences and 
poly A addition sequences; enhancer sequences which 

10 increase expression may also be included, and sequences 
which cause amplification of the gene may also be 
desirable. These sequences are known in the art* 

Vectors suitable for replication in mammalian 
cells are known in the art, and may include viral 

15 replicons, or sequences which insure integration of the 
appropriate sequences encoding PilC epitopes into the 
host genome. 

A vector which is used to express foreign DNA, 
and which may be used in vaccine preparation is Vaccinia 

20 virus. In this case the heterologous DNA is inserted 
into the Vaccinia genome. Techniques for the insertion 
of foreign DNA into the vaccinia virus genome are known 
in the art, and utilize, for example, homologous 
recombination. The insertion of the heterologous DNA is 

25 generally into a gene which is non-essential in nature, 
for example, the thymidine kinase gene (tk) , which also 
provides a selectable marker. Plasmid vectors that 
greatly facilitate the construction of recombinant 
viruses have been described (see, for example, Mackett et 

30 al. (1984), Chakrabarti et al. (1985); Moss (1987)). 
Expression of the polypeptide containing at least one 
immunoreactive PilC epitope then occurs in cells or 
individuals which are immunized with the live recombinant 
vaccinia virus. 
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Other systems for expression of desired 
polypeptides include insect cells and vectors suitable 
for use in these cells. These systems are known in the 
art, and include, for example, insect expression transfer 
5 vectors derived from the baculovirus Autographs. 

calif oxnica. nuclear polyhedrosis virus (AcNFV) , which is 
a helper- independent, viral expression vector. 
Expression vectors derived from this system usually use 
the strong viral polyhedrin gene promoter to drive 

10 expression of heterologous genes. Currently the most 
commonly used transfer vector for introducing foreign 
genes into AcNPV is pAc373. Many other vectors, known to 
those of skill in the art, have also been designed for 
improved expression. These include, for example, pVL985 

15 (which alters the polyhedrin start codon from ATS to ATT, 
and which introduces a BamHI cloning site 32 basepairs 
downstream from the ATT; See Luckow and Summers (1989)) . 

Methods for the introduction of heterologous 
DNA into the desired site in the baculovirus virus are 

20 known in the art. (See Summer and Smith, Texas 

Agricultural Experiment Station Bulletin No. 1555; Ju et 
al. (1987) ; Smith et al. (1983) ; and Luckow and Summers 
(1989)) . For example, the insertion can be into a gene 
such as the polyhedrin gene, by homologous recombination; 

25 insertion can also be into a restriction enzyme site 
engineered into the desired baculovirus gene. The 
inserted sequences may be those which encode all or 
varying segments of the polyprotein, or other orfs which 
encode viral polypeptides. . For example, the insert could 

30 encode the following numbers of amino acid segments from 
the polyprotein: amino acids 1-1078; amino acids 332-662; 
amino acids 406-662; amino acids 156-328, and amino acids 
199-328. 



WO 92/13871 



PCT/US92/00863 



-57- 

The signals for posttranslational 

modifications, such as signal peptide cleavage, 

proteolytic cleavage, and phosphorylation, appear to be 

recognized by insect cells. The signals required for 

5 secretion and nuclear accumulation also appear to be 

conserved between the invertebrate cells and vertebrate 

cells. Examples of the signal sequences from vertebrate 

cells which are effective in invertebrate cells are known 

in the art, for example, the human interleukin 2 signal 

10 (IL2 e ) which is a signal for transport out of the cell, 
s 

is recognized and properly removed in insect cells. 

Recombinant polynucleotides are inserted into 
host cells by transformation. Transformation may be by 
any known method for introducing polynucleotides into a 

15 host cell, including, for example packaging the 

polynucleotide in a virus and transducing a host cell 
with the virus, and by direct uptake of the 
polynucleotide. The transformation procedure used 
depends upon the host to be transformed. Bacterial 

20 transformation by direct uptake generally employs 
treatment with calcium or rubidium chloride (Cohen 
(1972); Maniatis (1989)). Yeast transformation by direct 
uptake may be carried out using the method of Hinnen et 
al. (1978). Mammalian transformations by direct uptake 

25 may be conducted using the calcium phosphate 

precipitation method of Graham and Van der Eb (1978) , or 
the various known modifications thereof. Other methods 
for the introduction of recombinant polynucleotides into 
cells, particularly into mammalian cells, which are known 

30 in the art include dextran-mediated transfection, calcium 
phosphate mediated transfection, polybrene mediated 
transfection, protoplast fusion, electroporation, 
encapsulation of the polynucleotide (s) in liposomes, and 
direct microinjection of the polynucleotides into nuclei. 
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The recombinant polynucleotide may be in the 
form of a vector. Vector construction employs techniques 
which are known in the art. Site-specific DNA cleavage 
is performed by treating with suitable restriction 
5 enzymes under conditions which generally are specified by 
the manufacturer of these commercially available enzymes. 
In general, about 1 microgram of plasmid or DNA sequence 
is cleaved by 1 unit of enzyme in about 20 microliters 
buffer solution by incubation of 1-2 hr at 37° C. After 

10 incubation with the restriction enzyme, protein is 
removed by phenol/chloroform extraction and the DNA 
recovered by precipitation with ethanol. The cleaved 
fragments may be separated using polyacrylamide or 
agarose gel electrophoresis techniques, according to the 

15 general procedures found in Methods in Enzymology (1980) 
££:499-560. 

Sticky ended cleavage fragments may be blunt 
ended using E. coli DNA polymerase I (Klenow) in the 
presence of the appropriate deoxynucleotide triphosphates 

20 (dNTPs) present in the mixture. Treatment with SI 

nuclease may also be used, resulting in the hydrolysis of 
any single stranded DNA portions. 

Ligations are carried out using standard buffer 
and temperature conditions using T4 DNA ligase and ATP; 

25 sticky end ligations require less ATP and less ligase 

than blunt end ligations. When vector fragments are used 
as part of a ligation mixture, the vector fragment is 
often treated with bacterial alkaline phosphatase (BAP) 
or calf intestinal alkaline phosphatase to remove the 

30 5 '-phosphate and thus prevent religation of the vector; 
alternatively, restriction enzyme digestion of unwanted 
fragments can be used to prevent ligation. 

Ligation mixtures are transformed into suitable 
cloning hosts, such as E. coli, and successful 
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transformants selected by, for example, antibiotic 
resistance, and screened for the correct construction. 
The desired recombinant DNA sequences may be 
synthesized by synthetic methods. Synthetic 
5 oligonucleotides may be prepared using an automated 
oligonucleotide synthesizer as described by Warner 
(1984) . If desired the synthetic strands may be labeled 
with 32 P by treatment with polynucleotide kinase in the 
presence of 32 P-ATP, using standard conditions for the 
10 reaction. 

DNA sequences, including those isolated from 
DNA libraries, may be modified by known techniques, 
including, for example site directed mutagenesis, as 
described by Zoller (1982). Briefly, the DNA to be 

15 modified is packaged into phage as a single stranded 

sequence, and converted to a double stranded DNA with DNA 
polymerase using, as a primer, a synthetic 
oligonucleotide complementary to the portion of the DNA 
to be modified, and having the desired modification 

20 included in its own sequence. The resulting double 

stranded DNA is transformed into a phage supporting host 
bacterium. Cultures of the transformed bacteria, which 
contain replications of each strand of the phage, are 
plated in agar to obtain plaques. Theoretically, 50% of 

25 the new plaques contain phage having the mutated 
sequence, and the remaining 50% have the original 
sequence. Replicates of the plaques are hybridized to 
labeled synthetic probe at temperatures and conditions 
which permit hybridization with the correct strand, but 

30 not with the unmodified sequence. The sequences which 
have been identified by hybridization are recovered and 
cloned. 

DNA libraries may be probed using the procedure 
of Grunstein and Hogness (1975) . Briefly, in this 
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procedure, the DNA to be probed is immobilized on nitro- 
cellulose filters, denatured, and prehybridized with a 
buffer containing 0-50% formamide, 0.75 M NaCl, 75 mM Na 
citrate, 0.02% (wt/v) each of bovine serum albumin, poly- 
5 vinyl pyrollidone, and Ficoll, 50 mM Na Phosphate (pH 
6.5), 0.1% SDS, and 100 micrograms /ml carrier denatured 
DNA. The percentage of formamide in the buffer, as well 
as the time and temperature conditions of the 
prehybridization and subsequent hybridization steps 

10 depends pn the stringency required. Oligomeric probes 
which require lower stringency conditions are generally 
used with low percentages of formamide, lower 
temperatures, and longer hybridization times. Probes 
containing more than 30 or 40 nucleotides such as those 

15 derived from cDNA or genomic sequences generally employ 
higher temperatures, e.g., about 40-42°C, and a high 
percentage, e.g., 50%, formamide. Following 
prehybridization, 5' - 32 P-labeled oligonucleotide probe is 
added to the buffer, and the filters are incubated in 

20 this mixture under hybridization conditions. After 
washing, the treated filters are subjected to 
autoradiography to show the location of the hybridized 
probe; DNA in corresponding locations on the original 
agar plates is used as the source of the desired DNA. 

25 For routine vector constructions, ligation 

mixtures are transformed into E. coli strain HB101 or 
other suitable host, and successful transformants 
selected by antibiotic resistance or other markers. 
Plasmids from the transformants are then prepared 

30 according to the method of Clewell et al. (1969), usually 
following chloramphenicol amplification (Clewell (1972)). 
The DNA is isolated and analyzed, usually by restriction 
enzyme analysis and/or sequencing. Sequencing may be by 
the dideoxy method of Sanger et al. (1977) as further 
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described by Messing et al. (1981), or by the method of 
Maxam et al. (1980) . Problems with band compression, 
which are sometimes observed in GC rich regions, were 
overcome by use of T-deazoguanosine according to Barr et 
5 al. (1986). 

An enzyme- linked immunosorbent assay (ELISA) 
can be used to measure either antigen or antibody 
concentrations. This method depends upon conjugation of 
an enzyme to either an antigen or an antibody, and uses 

10 the bound enzyme activity as a quantitative label. To 
measure antibody, the known antigen is fixed to a solid 
phase (e.g., a microplate or plastic cup), incubated with 
test serum dilutions, washed, incubated with 
anti- immunoglobulin labeled with an enzyme, and washed 

15 again. Enzymes suitable for labeling are known in the 
art, and include, for example, horseradish peroxidase. 
Enzyme activity bound to the solid phase is measured by 
adding the specific substrate, and determining product 
formation or substrate utilization colorimetrically. The 

20 enzyme activity bound is a direct function of the amount 
of antibody bound. 

To measure antigen, a known specific antibody 
is fixed to the solid phase, the test material containing 
antigen is added, after an incubation the solid phase is 

25 washed, and a second enzyme -labeled antibody is added. 

After washing, substrate is added, and enzyme activity is 
estimated colorimetrically, and related to antigen 
concentration . 

Example?? 

30 Described below are examples of the present 

invention which are provided only for illustrative 
purposes, and not to limit the scope of the present 
invention. In light of the present disclosure, numerous 
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embodiments within the scope of the claims will be appar- 
ent to those of ordinary skill in the art. 

Example 1 

5 Jpnlafinn of PilC 

An outer membrane preparation from 
N. gonorrhoeae strain MS 11^ (P + ) contains small amounts 
of a 110 kd protein, PilC. This protein was enriched 
during alternate cycles of crystallization and 

10 solubilization of pili, unlike other outer membrane 

proteins that decreased in abundance by this procedure. 

The materials and methods used for the 
isolation procedure were the following. 
B ? ri-grial stra5 ng and growth conditions 

15 N. gonorrhoeae MSll^ (Meyer et al., 1984) and 

P' and P"n variants of MSll^ (Swanson et al., 1986) were 
kindly obtained from Dr. M. So and from Dr. M. Koomey, 
respectively. The gonococcal isolates UM01 and KH4318 
have previously been described (Norlander et al., 1981). 

20 N. gonorrhoeae strains 605344 and 605103 were obtained 
from Dr. D. Danielsson, Orebro, Sweden, and strain 765 
was isolated at the Department of Bacteriology in Umea, 
Sweden. The commensal Jfeisseria species N. lactamica 
Nctc 10618 and N. subflava GN01 were obtained from 

25 Pharmacia, Uppsala, Sweden. These bacteria were grown at 
37°C in a 5% C0 2 atmosphere on Difco GCB agar containing 
Kellogg' s supplement. Piliated (P + ) and nonpiliated (P") 
variants were distinguished by colony morphology and 
passed as single colonies. E. coli strain Y 1090 

.30 (obtained from Promega Biotech) was used for plaque 
screening, DH5 (Hanahan, 1985) for molecular cloning, 
AA10 recA (Stoker et al. 1984) for isolation of minicells 
and TGI (Gill et al., 1986) for propagation of M13 
clones . 

35 
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Preoaration of pili and outer membranes 

Pili were prepared essentially as described by 
Brinton et al. (1978). Gonococci (P + Tr) from 80 GGB 
5 plates, grown for 18 h, were harvested in 0.05 M Tris- 
HC1 pH 8,0 and 0.15 M NaCl, washed twice and resuspended 
in 40 ml 0.15 M ethanolamine pH 10.5. Pili were sheared 
off in a Sorvall Omnimixer, setting 3 for 30 s. The cell 
debris was pelleted at 13,000 g for 30 min at 4°C and the 

10 supernatant was dialyzed against 0.05 M Tris-HCl pH 8.0 
and 0.15 M NaCl. The crystallized pili were pelleted at 
13,000 g for 60 min., resuspended in 0.15 M ethanolamine 
pH 10.5, and centrifuged at 23,000 g for 60 min. The 
supernatant was dialyzed as described above against 0.05 

15 M Tris-HCl pH 8.0 and 0.15 M NaCl. Several cycles of 
crystallization and solubilization were performed to 
produce pili preparations with high purity. Outer 
membranes of tf. gonorrhoeae were prepared by the sarkosyl 
method described by Norquist et al. (1978) . 

20 

Example 2 

Preparation Qt S^irifiefl Anti-PilC Antifroflies 

The 110 kd protein present in purified MSll^ 
pili preparations was eluted from SDS polyacrylamide gels 
25 and rabbit antibodies were generated against the' gel 
purified protein. The antiserum cross reacted 
extensively with the pilin protein in immunoblots and was 
therefore absorbed with extracts of Pseudomonas putida 
expressing the pilin subunit of N. gonorrhoeae on plasmid 
30 pGC02. 

Pili preparations of N. gonorrhoeae MSll ma (P + ) 
crystallized 5 times were separated on 10% SDS -poly- 
acrylamide gels using the buffer system of Laemmli 
(1970) . These gels were stained in 0.25 M KC1 and 1 iriM 
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DTT for 5 min. , the 110 kd protein band was sliced out, 
crushed and incubated in a buffer containing 0.05 M Tris- 
HC1 pH 7.9, 0.1 mM EDTA, 5 mM DTT and 0.15 M NaCl at 4°C 
overnight. Gel pieces were removed by centrifugation 
5 prior to immunization of rabbits. 

The achieved 110 kd- antiserum was extensively 
absorbed with Pseudomonas putida 2440 (Bagdasarien et al. 
(1983), carrying a recombinant plasmid, pGC02, 
constructed as follows. The 1.0 kb flpal-EcoRI fragment 

10 of the pilus gene clone pNGHOO (Meyer et al. 1984) 

obtained from M. So was cloned into the ffpal and EcdRl 
sites of pMMB66 (Furste et al., 1986). The pilE gene is 
then under control of the tac promoter and induction with 
1 mM IPTO resulted in high levels of pilin produced in P. 

15 putida 2440, but no extracellular pili structures were 
observed. Dense sonicated cultures of P. putida 
2440/pGC02 were mixed in a 1:1 ratio with the crude 
antiserum. About 15 cycles of 1 h incubation and 30 min 
centrifugation at 25,000 g in the presence of 1 mM PMSF 

20 (phenylmethylsulfonylfluoride) at 4°C were performed. 

The pili antiserum used in immunoblots was 
generated in a rabbit against highly purified pili 
preparations of N. gonorrhoeae MSll^. 

In immunoblots 10 fig of boiled bacterial cells 

25 or the same amount of outer membranes were 

electrophoresed on 10% SDS-polyacrylamide gels. The 
proteins were transferred from the gel onto 
nitrocellulose sheets where their immunological cross - 
reaction with the 110 kd absorbed antiserum was tested 

30 using an immunoblotting protocol as described by Towbin 
et al. (1979) . 
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Tgyafy ple 3 

Specificity of Purified Anti-PilC Antibodies 

The absorbed antiserum was used in immunoblots 
with whole cell extracts of a number of N. gonorrhoeae 
5 strains as well as commensal strains of Neisseria (Figure 
2A) . All strains of N. gonorrhoeae, except strain 
605103, contained one or two high molecular weight 
protein species reacting with the antiserum. Strain 
605103, unlike the other strains tested, was nonpiliated 

10 and no piliated variants could be obtained suggesting 
that it is a P"n variant (Swanson et al., 1985). This 
was confirmed by Southern blot hybridization using an 
oligonucleotide probe corresponding to the 5' end of the 
pilE gene. No hybridization was obtained with this 

15 probe. The commensal N. lac t arnica Nctcl0618, but not N. 
subflava GN01, contained a high molecular weight protein 
reacting with the 110 kd antiserum, tnamunoblots against 
outer membrane preparations of P + and P~n MSll^ showed 
the 110 kd protein to be present in the outer membrane in 

20 both of these MS 11 variants. 

Southern blot hybridization was accomplished as 
follows. Digested genomic DNA was separated on 0.7% 
agarose gels and transferred to nitrocellulose filters 
(Southern, 1975). After transfer and baking the filters 

25 were prehybridized in a mixture of 5 x SSC, 0.1% SDS, 
5 mM EDTA, 5 x Denhardt's solution and 100 fig/vcOL of 
sonicated calf thymus DNA at 65°C for 2-6 h. 32 P-labeled 
probe (multiprime DNA labelling system, Amersham 
International) was added and hybridization was performed 

30 for 12-15 h at the same temperature. The filters were 
washed in 2 x SSC with 0.1% SDS and in 0.2 x SSC with 
0.1% SDS for 2 x 15 min each, dried and exposed to Kodak 
XRP film at -80°C. 
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A 21 -base -long oligonucleotide complementary to 
the signal peptide coding region of pilE (5'- 
C^CTTTTTGAAGGGTATTCAT-3 ' ) was 32 P-labeled with T4 
polynucleotide kinase and used to probe Clal-digested 
5 genomic DNA. The blot was prehybridized at 37°C in a 

mixture containing 2 x Denhardt's, 0.1% SDS, 2.5 mM EDTA, 
5 x SSC and 100 !Lg/vd- sonicated calf thymus DNA, 
hybridized at 37°C and washed in 2 x SSC for 5 min. 
MSll^tP+j gave a 4 kb hybridization fragment, whereas 
10 MSll mk (P"n) and 605103 gave no hybridization signal. 

fl/mtgmlar clo ning of the vilCl gene 
ancodinq a 110 kd protein 

15 Chromosomal DNA from N. gonorrhoeae MSll^fP ) 

was used to construct a Xgtll library. The library was 
screened with the absorbed 110 kd antiserum and one 
positive clone out of 10,000 plaques was found, 
containing an 800 bp insert. A lysogen of this positive 

20 Xgtll clone was examined in immunoblots and a fusion 

protein with an estimated size of 150 kd reacted with the 
antiserum (data not shown) . The 800 bp insert was 
purified, labeled with 32 P, and used as a probe to screen 
a plasmid library from N. gonorrhoeae MSll^. Six clones 

25 out of 10,000 hybridized with the probe. Restriction 

maps for these partially overlapping six clones are shown 

in Figure 1. 

In Figure 1, plasmids pABJ04-09, which all 
belong to locus 1, were isolated from a plasmid library 
30 using the 800 bp insert from Xgtll as a probe. The Xgtll 
insert (from locus 2) has an additional Sail site not 
found in the plasmid clones. The position of the pilCl 
gene and direction of its transcription (indicated. by an 
arrow) were determined in E. coli minicells. Three 
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thickened lines, with an arrow-head at each end, indicate 

fragments used as probes in Southern hybridizations. 

I.e., the 800 bp insert from Xgtll, the EcoRV 1 -BcoRV 2 

(1.3kb) and the EcoRV 3 -HfndIII 4 (0.8 kb) fragments of 

5 pAGJ04. Triangles mark the location of two mTnCm 

insertions in pABJ04. The resulting plasmids, 

pABJ)4: :mTnCm-12 and pABJ04: :mTnCm-14 were used, were 

used to inactivate pilCl and pilC2. 

The six plasmid clones, pABJ04-09, were 

10 transformed into the minicell producing strain AA10 to 

35 

monitor expression of plasmid encoded [ S] methionine 
labeled proteins. The E. coll minicell strain AA10 was 
transformed with plasmid DNA (pABJ04-09) and chromosome 
deficient minicells from these strains were purified over 

15 sucrose gradients (Thompson and Achtman, 1978) . The 

plasmid- encoded proteins were labeled in the presence of 
80 fid [ 35 S] methionine in minimal salts medium and 1% 
methionine assay medium (Dif co) . After lysis of the 
minicells in sample buffer (Laemmli, 1970) the proteins 

20 were electrophoresed on an SDS-polyacrylaitiide gel, the 

gel was dried and exposed to X-ray film (Kodak X-OmatAR) . 

Plasmid pABJ04 expressed minute amounts of 
three high molecular weight proteins, 113, 111 and 108 kd 
in size, as well as a number of lower molecular weight 

25 protein species not produced from the vector control. 
The three high molecular weight bands were missing in 
pABJOS and pABJ06 but three novel lower molecular weight 
protein species had appeared, suggesting that pABJOB and 
pABJ06 are deleted for the .3' end of a gene, denoted 

30 pilCl, and that this gene is responsible for all three 
high molecular weight species. This suggested that the 
distal end of the gene must be located between the Mlul 1 
and Mull 2 sites (Figure 1) . The observation that plasmid 
pABJ07 did not express any high molecular proteins 
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tentatively located the 5' end of the gene to a region 
0.5-1.2 kb to the right of the EcoRV 3 site. The size for 
a gene encoding a 110 kd protein is ~3 kb which is in 
agreement with these mapping data. 

5 

Example 5 

Identification of a Second Gene Enc oding PilC 
The 800 bp insert in Xgtll contains a single 
Sail site not present in the region on pABJ04 which 

10 hybridized to this fragment, suggesting that there is 

more than one pilC locus in the genome of N. gonorrhoeae 
MSll. This was confirmed in Southern blot hybridizations 
in which three different pilC fragments were used to 
probe final and Clal digested genomic DNA. The 800 bp 

15 fragment from Xgtll hybridized in a Southern blot to two 
Clal (18 and 8 kb) and fimal (13 and 4.5 kb) fragments of 
DNA prepared from N, gonorrhoeae MSll^. Since the probe 
does not contain any internal Clal or final sites, there 
are presumably two copies of the 3' end of pilC in the 

20 MSll genome. The 1.3 kb EcdR^ - EcoRV 2 fragment of 

pABJ04 carries the central region of pilCl. This probe 
hybridized to the same two Clal fragments and to four 
Steal fragments, two of which are the same size as the two 
fimal fragments identified with the 800 bp probe (13 kb 

25 and 4.5 kb) . Hybridization with the 800 bp probe was 
more extensive to the 8 kb Clal and the 4 kb fimal 
fragment whereas the reverse was found with the 1.3 kb 
■EcoR^ - BcoRV 2 fragment from pABJ04 strongly suggesting 
that the two genomic copies of pilC show a significant 

30 sequence variation in the 3' as well as in the central 
region. A probe corresponding to the 5' region of pilCl 
was also used in Southern hybridization experiments. 
This 0.8 kb HindIII 4 - EcoRV 3 fragment hybridized to two 
Clal (18 kb and 4 kb) and fimal (25 kb and 7 kb) fragments 
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with seemingly equal efficiency. The hybridization 
pattern was identical using DNA from MSII^. Taken 
together these hybridization data indicate that 
N. gonorrhoeae MS11 contains two complete copies of pilC. 
5 Furthermore the two genes appear to be more homologous in 
their 5' as compared to their central and 3' regions. 

The results indicate that the 800 bp insert 
from Xgtll carries information from pilC2 whereas the 
clones pABJ04-09 must carry information from pilCl. 

10 Finally pilC2 must be located >2 kb from either end of 
pilCl. The DNA sequence of the 3' -end of the pilC2 
fragment is shown in Figure 7. The sequence showing the 
putative amino acids encoded therein are shown in Figure 
8. A comparison of the analogous portions of pilC2 (top) 

15 . and pilCl (bottom) DNA sequences, and the putative amino 
acids encoded therein are shown in Figure 9. 

The 800 bp fragment from p±lC2 was also used to 
probe digested genomic DNA from N. gonorrhoeae strains 
UM01, 765 and 605103. The latter isolate does not 

20 express detectable levels of the 110 kd protein. Strain 
UM01, unlike MSll, contained only one Clal fragment of 15 
kb that hybridized to the probe (data not shown) . Hence, 
this strain may contain only one copy of pilC. Strain 
605103 and 765, on the other hand, each seem to contain 

25 two copies of pilC since two Clal and two Sraal fragments 
hybridized to the 800 bp probe. 

The commensal N. lactamica Nctcl0618 DNA 
digested with Clal and Sural also hybridized with the 800 
bp probe. Since only one band hybridized in each case 

30 this strain may contain only one copy of pilC. In 

contrast, N. subflava GN01 did not hybridize to the 800 
bp pilC2 probe using the same stringency. 
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T7.yarnp le 6 
rha-racfceriz afion of pUC Genes 

Th* p-MCl aene on nABJ04 i « t-ranalafi onallv out Of f^ame 
5 The amino terminal sequence of gel purified 110 

kd protein from strain MSll^g (P + ) was determined by 
sequential Edman degradation. For aminoterminal sequence 
determination automated Edman degradations (Edman and 
Bregg, 1967) were performed in an updated Beckman 890C 

10 spinning cup sequencing sequencer. The sequencing 

procedure and the method for analysis of the 3 -phenyl -2- 
thiohydantoin derivatives been described (Engstrom et 
al., 1984). Considerable difficulties were encountered 
in the method probably due to blocking of the N- terminus. 

15 As a result, only the residues from position 4 to 10 were 
obtained (Figure 2). 

The 3.3 kb HindIII 4 - Mlu^ fragment 
encompassing the entire pilCl gene was sequenced on both 
strands using the dideoxy sequencing method adapted for 

20 single stranded DNA. 

Purified DNA fragments from pABJ04 and PCR- 
amplified 5' end of pilCl was subcloned into M13 vectors 
(Sanger et al., 1980: Yanish-Perroa et al., 1985) and 
sequenced using the chain termination method of Sanger et 

25 al. (1977) . Primers used were the M13 17-mer universal 
primer and oligonucleotides synthesized at Symbicon, 
Dmei, Sweden or at the Department of Biochemistry, 
Washington University, St., Louis, MO, USA. 

The results of the sequencing showed that the 

30 pilCl contained one single open reading frame of 997 

codons (from left to right in Figure 2) and starting at 
an AUG codon 195 bp from the HindIII 4 site. Codons 7-12 
in this open reading frame corresponded to amino acids 5- 
10 in the sequence of the gel purified protein. The AUG 
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codon in the beginning of the long open reading frame was 
not preceded by a typical Shine -Dalgarno sequence. 
Moreover, since the 110 kd protein is located in the 
outer membrane of N. gonorrhoeae, we expected the protein 
5 to be translated with a signal sequence. When examining 
the nucleotide sequence, an AUG codon was found in frame 
1 that was preceded by a typical Shine -Dalgarno sequence 
( - AGGAA- ) . The sequence following this AUG codon would 
encode a typical signal peptide with basic amino acids in 

10 the amino terminal region and a hydrophobic central 

region. However, no signal peptidase cleavage site could 
be predicted following the rules of von Heijne (1983) . A 
tract of 12G residues was found in the region encoding 
the putative signal peptide for PilC. Addition of one G 

15 residue or the loss of two would align the long open 
reading frame with the AUG codon in frame 1. The 
translated region in frame 2 contains a putative signal 
peptidase cleavage site between Ala and Gin. A cleavage 
at this site would align the determined amino acid 

20 sequence at positions 5-10 for the 110 kd protein with 
the deduced amino acid sequence ♦ The data therefore 
suggested that the cloned pilCl gene is out of frame due 
to frameshifting in the region encoding the signal 
peptide . 

25 Figure 2 shows the nucleotide sequence and the 

deduced amino acid sequence of the 5' -end of pllC. The 
amino- terminal sequence of gel purified PilC from 
MBll^CP*) is shown in a box below frame 2, a 997 amino 
acid long open reading frame that would code for a 

30 protein about 110 kd in size. Frame 1 contains 41 amino 
acids and is preceded by a putative Shine -Dalgarno 
sequence (underlined) . Two horizontal lines mark a 
stretch of 12 G residues. An addition of one G in this 
region would align the ATG (boxed) in frame 1 with frame 
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2. Numbers above the sequence show base positions 
relative to the HladIII 4 site (=0) located on pABJ04. 
The position of two 24 bp oligonucleotide primers 
(opposite stands) used for PCR amplification, are 
5 indicated above the sequence by hatched bars. 

Figure 3 shows the nucleotide sequence of the 
sense strand of the pilCl gene. 

Figure 4 shows the shows the nucleotide 
sequence of the sense strand of the pilCl gene and the 
10 amino acids encoded therein. 

noT^H ? jnacfcivat ion of nllO. hut not vllCl 
ahnliahes p v pression of the 13 n krt protein in MS11 

Plasmid pABJ04 was mutagenized in E. coli by a 

15 transposon mini-Tn3 derivative, mTnCm. The shuttle 

mutagenesis system developed by Seifert et al., (1986) 
using a miniTn3 carrying the chloramphenicol resistance 
gene was kindly provided by Dr. M. So. Mutagenesis of 
pABJ04 with mTnCm and transformation of N. gonorrhoeae 

20 were performed as previously described (Seifert et al., 
1990) . MiniTnCm insertions at 30 different positions in 
pABJ04 were identified, two of which mapped within the 
PilC gene. Piliated N. gonorrhoeae MSll^ were 
transformed with 2 jig plasmid DNA, transformants were 

25 selected for on plates containing 10 jig/ml 

chloramphenicol for the single mutants and 30 /ig/ml 
chloramphenicol for the double mutants. 

Only two mTnCm insertions had occurred in 
pilCl (Figure 1) . Truncated protein species were seen in 

30 minicells with the mTnCm-14 insertion located 0.5 kb from 
the 3' end of pilCl but not with the mTnCm-12 insertion 
located 0.5 kb from the 5' end of the gene. Both 
insertion mutants were used in a gene replacement 
eaqjeriment. Plasmids pABJ04: :mTnCm-12 and pABJ04 : :mTnCm- 
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14 were linearized with BamHI and transformed into 
N. gonorrhoeae MSllj^P*) and transf ormants resistant to 
10 /ig/ml of chloramphenicol were selected. Forty- eight 
P + transf ormants (24 from each experiment) were assayed 
5 for the presence of Pile in immunoblots . All these 
transformants remained capable of expressing the PilC 
protein. Genomic DNA was prepared from seven of the 
chloramphenicol resistant transformants (five from 
pABJ04: :mTnCm-12 and two from pABJ04: :mTnCm-14) , cleaved 

10 with Clal and PvuII and used in Southern blot experiments 
using the EcoK^ - EcoRV 2 fragment of pABJ04 as a probe. 
The 8 kb Clal fragment was unaf fected in the mutants 
whereas the 18 kb Clal fragment had been replaced by a 20 
kb fragment. PvuII cleaves within the 1.6 kb mTnCm 

15 element. The probe detected an 8 kb PvuII fragment in 
both parent and mutant DNA. In the mutants, a novel 
PvuII fragment appeared that was 6.2 kb in size in five 
transformants obtained with pABJ04: :mTnCm-12 and 4.8 kb 
in size in two transformants with pABJ04: :mTnCm-14. To 

20 confirm the insertion of mTnCm, a 250 bp EcoRI-Hindlll 
fragment of the CAT GenBlock (Pharmacia, Sweden) , 
containing the PvuII site, was used as a probe. It 
detected the larger of the two Clal fragments as well as 
the 6.2 kb PvuII fragment. In addition, a 2 kb PvuII 

25 fragment not covered with the pile probe was detected. 
These data demonstrate that we have obtained gene 
replacements in pilCl, whereas pilC2 was unaf fected in 
all seven P + , PilC + transformants. 4 A rapid hybridization 
was done to screen the remaining 41 P + transformants. 

30 All but one had mTnCm inserted in pilCT. The remaining 
transformant had an intact locus 1 and 2 and must 
therefore contain mTnCm elsewhere in the gonococcal 
chromosome . 
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In the same transformation experiments , the 
frequency of P" colony variants was about five- fold 
higher as compared with that occurring normally in strain 
MSll mk (P + ) . Two P'mTnCm-12 transfonnants isolated at 10 
5 /ig/ml of chloramphenicol were also analyzed by Southern 
blot hybridization using the EcoH^ - EcoRv 2 fragment of 
pABJ04 and the EcoRl - Hindlll fragment of the CAT 
GenBlock. Each of these mutants carried mTnCm in pilC2 
as evidenced by a replacement of the 8 kb Clal fragment 

10 by a fragment 9.5 kb in size that hybridizes to both 
probes. These pilC2 : rmTnCm insertion mutants did not 
express PilC as determined by immunoblot analysis. 

A p + , pilCli :mTnCm- 12 mutant was retransf ormed 
with DNA prepared from a P~, pilC2i :mTnCm- 12 mutant and 

15 colonies growing at 30 /ig/ml of chloramphenicol were 

selected to obtain double mutants in pilC. All resistant 
transfonnants were P", and when analyzed by Southern blot 
hybridization all contained mTnCm in both pllCl and 
pilC2. Electron microscopy revealed that the P , 

20 piICl::mTnCm-12 mutant still expressed pili albeit at a 
slightly lower level than the MSll mk (P + ) parental clone, 
whereas the P~, pilC2: rmTnCm- 12 was completely bald as 
was the pilCl, p±lC2 double mutant. 

Immunoblot analyses were performed on the P 

25 pilCli tmTnCm- 12 mutant, the P", piIC2: jmTnCm-12 mutant 
and the P", pilCl: tmTnCm- 12, pilC2: :mTnCm- 12 double 
mutant, using PilC and pili antisera. Inactivation of 
pilCl did not abolish expression of PilC or the pilin. 
Inactivation of pilC2 totally abolished expression of 

30 PilC but did not affect expression of pilin. The pilCl, 
pilC2 double mutant was PilC" but produced only low 
levels of pilin. Taken together these data imply that 
pilC2 but not pilCl is expressing PilC in the MS11 
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variant under study. Moreover, inactivation of pilC2 but 
not pilCl was associated with a loss of piliation. 

P + revertants occurred spontaneously at a low 
frequency in the pilC2z :mTnCm- 12 mutants. These 
5 revertants expressed pili as determined by electron 
microscopy and also expressed PilC. It is likely that 
PilC expression is due to in- frame switching in pilCl. 

Figure 5 shows the nucleotide sequence of the 
sense strand of the pilCl gene, and the effect of frame 
10 shift on the putative gene products encoded therein. 

The pile genes of N. gonorrhoeae vary 

in the length of the G tract 
Polymerase ch ain reaction (PGR) with Tag 
15 polymerase was used to analyze the 5' region of pilC 

using two 24 base long synthetic oligonucleotides based 
on the sequence of pilCl (Figure 2) . These 
oligonucleotides would generate an amplified fragment of 
149 bases as judged from the sequence obtained from 
20 pABJ04. 

Polymerase chain reaction was carried out in 
100 fil containing 50 ng of genomic DNA or 5 ng of plasmid 
DNA. 1.0 /xM of each oligonucleotide, 200 fiM of each 
nucleotide, 0.001% gelatin, 1.5 mM MgCl 2 , 10 mM Tris (pH 

25 8.3), 50 mM KC1, 0.25 fil 1 mCi/ml [ 32 P]dATP and 2 U of 
Taq Polymerase (Perkin Elmer Cetus) . The samples were 
passed through 25 cycles: 2 min at 50°C, 1 min at 94 °C 
and 3 min at 72 °C in a Thermal Cycler (Perkin Elmer 
Cetus) . Aliquots of the DNA fragments were denatured at 

30 95°C for 2 min and electrophoresed on standard denaturing 
sequencing gels. 

The amplified products from MSll^CP*) DNA were 
149 and 150 long respectively. In addition, two less 
abundant products of 151 and 148 bases were seen. The 
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amplif ied products were electroeluted and cloned into 
M13mpl8, and twenty phage clones were sequenced using a 
universal primer. Four different sequences were obtained 
(Figure 6) . 

5 Figure 6 shows the nucleotide sequence of PCR 

amplified fragments demonstrating a variation in the 
length of the 6 tract and sequence differences in the 5'- 
region of the pilC genes. The two oligonucleotide 
primers used for the PCR are shown in Figure 2. 

10 Amplified DNA was cloned into M13mp8 and sequenced. 

Shown are the complete nucleotide sequence in between the 
two primers. In- frame sequences are translated and the G 
stretches are underlined. The putative cleavage sites 
are marked with arrows. Genomic DNA from N. gonorrheae 

15 strains MS11 (P + .PilC + ) , UM01 (P + .PilC + ) , 765 (P + .PilC + ) 
and 605103 (P'n.PilC), and purified DNA from 
pABJ04/AA10 (recA) was used in the PCR. 

Variant patterns la and lb were identical to 
each other and to the cloned sequence on pABJ04 except 

20 for the presence of 11 instead of 12 G residues in the G 
tract of lb. The G tract of sequence 2a was 13 residues 
long indicating that the sequence is in frame. In 
addition, this sequence differed from pilCl by four 
basepair substitutions outside the G tract, including an 

25 AAA lysine codon four triplets downstream of the putative 
signal peptide processing site which is in agreement with 
the lysine residue found in the fourth position of the 
gel purified 110 kd PilC protein. Sequences la and lb 
contained CAA, the codon for Gin, at the same position. 

30 Sequence 2b was identical to 2a except for the presence 
of 12 G residues in the G tract. These data are 
compatible with sequence 1 being from pilCl and sequence 
2 from pilC2 and further support that pilC2 must be the 
expressed locus in the MS11(P + ) variant we are studying. 
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Strain UM01 apparently only contains one copy 
of pile. DNA from this strain generated five amplified 
fragments ranging in size from 148 to 152 bp in the PCR 
reaction. The most abundant fragments were 149-151 bp 
5 long. Among ten M13 clones, three variant 1 sequences 
were found (a,b,c) that differed only in the number of G 
residues (11-13) in the G tract (Figure 6) supporting the 
hybridization data that this strain contains only one 
pilC gene. Since a PilC protein is expressed from UM01 
10 we suggest that the majority of cells has 13 Gs in the G 
tract . 

Strain 765 contains two pilC loci, both of 
which seem to be trans lationally ON based on the presence 
of two high molecular weight proteins reacting with the 

15 absorbed PilC antiserum. A number of amplified fragments 
were seen after the PCR reaction ranging in size from 149 
to 153 bases. Three variant sequences were found among 
nine clones (Figure 6). The G tract was 13 residues long 
in variant 3a (in frame) and 14 (out of frame) in variant 

20 3b whereas sequence variant 4 contained 11 G residues in 
the G tract. Variant sequence 4 contained four 
additional nucleotides (-CAGG-) distal to the G tract 
relative to variant sequences 1, 2 and 3, indicating that 
the amplified product with 11 Gs from this variant 

25 sequence is 152 long and out of frame. Two PCR amplified 
products 152 and 153 in length were obtained from strain 
765 suggesting that in frame variants of sequence 4 might 
be present in the DNA prepared from this strain. 

Strain 605103 carries two pilC copies, both of 

30 which seem to be translationally OFF. The amplified 

fragments were 148 and 149 bases in size. Out of eight 
M13 clones only variant la and lb sequences were found, 
with 11 and 12 Gs in the G tract respectively. 
Consequently, we were unable to find an in frame sequence 
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variant from this strain. We do not know if the 5' ends 
of the two pile genes are identical in this strain or if 
one pile gene differed from pilCl in the region 
corresponding to the oligonucleotides used for 
5 amplification. In the latter case we would not expect to 
obtain any amplified products from the second copy. 

The only in frame variant found in DNA 
amplified from N. gonorrhoeae carried 13 Gs in the G 
tract. To see if variants with lOGs arise in products 

10 expressed in E. coli, PCR amplified products were 

generated from pABJ04 purified from E. coli strain AA10, 
using the same two oligonucleotide primers as before. 
Out of 12 sequenced clones, two carried 10 Gs in the G 
tract (Figure 6) . The majority of clones (seven) carried 

15 12 Gs as expected. It is likely that the PCR 

amplification products are not representative of the 
original DNA population. However, the distribution of 
variation in the G tract is consistent with a model in 
which only one G residue is gained or lost at one given 

20 event. Since AA10 is recA, frameshift mutations in the G 
tract in E. coli occur independently of the RecA protein. 

N. meningitidis contains two pilC loci 
Southern blot hybridizations using MSll pilCl" specific 

25 probes identified multiple fragments when meningococcal 
genomic DNA is digested with a variety of restriction 
endonucleases . PCR amplification using two 24 -base 
oligonucleotides from the 5' end of MSll pilCl as primers 
yields multiple fragments ranging in size from 148 to 151 

30 bases. DNA sequencing of fragments cloned into phage M13 
identifies two classes of sequences, as in IT. 
gonorrhoeae, which differ outside the G- tract. Variation 
occurred within each class with respect to the number of 
G's in the G- tract. Therefore, N. meningitidis must 
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carry two pilC loci, as does N. gonorrhoeae, which should 
also be under the control of translational frame 
shifting. 

Both pilC loci are cloned from Iff. meningitidis 
5 by generating an EMBL3 library and screening this library 
with pilCl' and pilC2~ specific DNA from N. gonorrhoeae 
strain MS11. 

Genomic DNA from N. meningitidis is partially 
digested with Sau3A and fragments ranging from 9 to 20 kB 

10 are ligated into the lambda EMBL3 vector . Because of the 
packaging constraints of the phage, only those lambdas 
which contain DNA fragments of this size will be packaged 
(i.e., are viable). The library thus constructed can be 
screened with genomic oligonucleotide or cloned gene 

15 probes following selection in a lysogenic E. coli strain. 
(Frischart, A.M. et al (1983), J. Mol. Biol. 170:827). 
Preferably, full length clones are identified by 
screening for clones hybridizing to both the 5' and 3' 
ends of pilC. If full length clones cannot be obtained 

20 from the EMBL3 library, pilC specific probes may be used 
to screen a plasmid library from the same strains. 

Translational fusion proteins with j8- 
galactosidase may also be screened for in a Xgtll 
library, using 0-galactosidase and PilC" specific 

25 antisera in Western immunoblots . j8-galactosidase-PilCl 
and 0-galactosidase-PilC2 fusion proteins are purified 
from the cytoplasm of recombinant B. coli and used to 
raise specific antisera. 

30 Example 7 

The immunobioloaical properties of PilC 
PilC is located in the outer membrane of 
Neisseria. The immune response during natural infection 
can be assessed by screening convalescent sera for anti- 
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10 



PilC antibody. The presence of only two pilC loci 
suggests that PilC is only moderately variable, however. 
This together with its essential role in pilus biogenesis 
makes PilC attractive as a potential vaccine candidate. 

Two types of PilC translational fusions using 
alkaline phosphatase and 0-galactosidase are generated. 
In the first instance a secreted fusion protein is 
obtained that may associate with the outer membrane. In 
the second instance the fusion proteins may accumulate in 
the cytoplasm as inclusion bodies. The construction 
schemes for such fusion proteins uses techniques known in 
the art. TnphoA insertions on plasmid pABJ04 in E. coli 
are generated, and a PhoA + phenotype is screened for as 
blue colonies on media containing the chromogenic 
15 substrate XP. If such clones have the phoA gene in frame 
with an in frame variant of pilCl the fusion product 
should be able to cross the cytoplasmic membrane where it 
can be analyzed by Western immunoblots using an alkaline 
phosphatase specific antiserum and our PilC antiserum 
20 raised against gel-purified PilC2 from MS11 (P + ) . 

LacZ: zpilC fusions are generated by cloning different 
segments of pilC into a lacZ" containing vector used to 
generate translational fusions. Similar constructs are 
performed on each of the two pilC genes from N. 
25 meningitidis. Antisera are generated against fusion 
proteins after their purification using conventional 
protocols. These antisera are extensively adsorbed with 
extracts of E. coli expressing alkaline phosphatase and 
0-galactosidase, and used in Western immunoblots and 
30 ELISA assays against a panel of Neisseria gonorrhoeae and 
Neisseria meningitidis strains. Antisera raised against 
fusion proteins carrying the major portion of PilC are 
also analyzed in Western blots using E. coli expressing 
fusion proteins containing only smaller regions of PilC. 
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The results of these studies should show which regions in 
PilC are immunodominant. 

The pilCl and pilC2 genes are highly homologous in their 
5' ends whereas the homology is considerably less 
5 pronounced in the central and 3' region. 

In addition, the entire pilC2 gene from N. 
gonorrhoeae MS11(P + ) is cloned and sequenced. Algorithms 
are used to search for potential T-cell epitopes 
(amphipathic helical conformation) and jS-cell epitopes. 

10 Polypeptides containing the predicted epitopes are tested 
to determine if they can prime mice for an e nh a n ced 
immune response to PilCl and PilC2. 

Specific PilC antisera are used in 
immunoelectromicroscopy with piliated Neisseria cells as 

15 well as with purified pili to see if PilC is physically 
connected with the pilus fiber. 

Neisseria is grown in the presence of different 
dilutions of PilC" specific antibodies. Bactericidal 
effect exerted by the antiserum, effects on piliation, 

20 and effects on bacterial attachment to corneal primary 
culture cells are monitored. Binding assays to 
epithelial cells are described in Tjia, K.F. et al. 
(1988), Graefe's Arch. Clin. Exp. Opthalmol. 226:341- 
345. 



25 



)l9 8 

Identification and characterization of genes 
located adjacent to pilC 

30 The pilCl and pilC2 loci are part of a larger duplication 
that extends both 5'- and 3'- of pilC. We know from our 
work with E. coli that strains may contain multiple gene 
clusters for the same class of pili. In one case we have 
shown that the only difference between two duplicated 
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gene clusters (pap and prs) resides in the adhesion genes 
such that each cluster gives rise to serologically 
identical pili binding to different cell surface 
receptors . 

5 mTncm mutagenesis in the region upstream and 

downstream of pilCl is performed to generate allelic 
replacements in the pilCl and pilC2 regions on the 
chromosome. Since the two regions are highly homologous 
we expect to obtain for each insertion allelic 

10 replacements in either region. Double mutants are 
generated as before by isolating DNA from mutants 
carrying insertions in the pilC2 region transforming P + 
variants carrying the same insertion in the pilCl region 
and select for transformants resistant to 30/ig/ml of 

15 chloramphenicol. These double mutants are examined for 
piliation, pilins expression, and binding to corneal 
primary culture cells. 

Example 9 

20 Ph ? fie> variation in conoco rral pIU expression 

ran h» ranged by framese t imitations in P&IG 
If Pile is required for pilus formation, we 
would expect some P" progeny arising from a P + clone to 
accumulate unassembled pilin in the absence of PilC. 
25 Nonpiliated (P~) colonies were derived from MS 11^ (P ), 

restreaked, and tested for the presence of PilC and pilin 
in immunoblots with the PilC and pili antisera. Five out 
of eight P" clones did not produce detectable levels of 
PilC, but expressed the pilin subunit. The remaining 
30 three P" clones expressed PilC but not pilin. The 

molecular mass of the pilin subunit was the same in the 
P", PilC" variants as in MS 11^ (P + , PilC + ) . However, the 
former in addition produced a protein reacting with the 
pili antiserum that was 16 kd in size. Since MS 11^ only 
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contains one expression site for pilin we believe that 
this protein species represents a proteolytic degradation 
product of the pilin and nay be identical to the S -pilin 
previously described (Haas et al., 1987). Several 
5 independent P" clones were isolated from one P", PilC" 
clone. They all remained PilC" and retained expression 
of pilin. Piliated (P + ) revertants were also obtained 
from the same P" , PilC" clone. These P + revertants 

-4 

occurred at about a tenfold lower frequency (10 ) than 

10 P" derivatives from a P + clone. All P + revertants from a 
P", PilC" clone had regained expression of PilC. All but 
one expressed a pilin with the same molecular weight as 
the nonpiliated parent. However, the low molecular 
weight pilin degradation product was much less abundant 

15 in the P + , PilC" revertants. It was possible to obtain 
P + revertants from other P", PilC" clones as well, all of 
which expressed PilC. 

The pilE gene from one set of PilC switches was 
PCR amplified and sequenced directly. The P", PilC" , 

20 pilin + variant 8 carried eight amino acid changes in the 
pilin relative to the parental clone MSll^^ The pilin 
sequence of the P + , PilC" backswitcher 8:1 was identical 
to variant 8. Thus, the backswitching from P'to P + 
colony morphology was not associated with any alteration 

25 in the pilus subunit protein implying that the change in 
colonial morphology was due to the switch in PilC 
expression. 

Strain MSll^P"), variants 8(P") and 8:1 (P + ) 
were also examined by transmission electron microscopy. 
30 Electron microscopy was performed with a JEOL 10 OCX 

microscope with 2 00 -mesh copper grids coated with thin 
films of 2% Formvar. The bacterial colonies were 
carefully overlaid with buffer [10 pM Tris-HCl (pH 7.5), 
10 /iM magnesium chloride] and the cells were allowed to 
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sediment for 15 min on a grid. The grids were washed 
with water, negatively stained with 1% sodium 
silicotungstate (pH 7.0) and then washed again. 

The electron micrographs showed that the 
5 MS11(P + ) parental cells were heavily piliated and pili 
were of ten seen to aggregate. In contrast most cells of 
variant 8(P") were nonpiliated. One or two pili were 
found on -10% of these cells. All cells of variant 8:1 
(P + ) were piliated, carrying -10-40 fibers/cells. No 

10 aggregation of individual fibers was seen. These data 
confirm that the observed changes in colonial morphology 
reflect alterations in expression of pili. Therefore, 
phase variation of gonococcal pili may not only be caused 
by recombination events occurring in the pilE locus 

15 (Bergstrom et al, .1986; Swanson et al., 1986) but also by 
frameshift mutations in pllC. 



revam p! P. 10 
3jnmnunoaenn rity of PilC 

20 In order to predict a region of PilC which 

would have a high probability for antigenicity, residues 
300 to 700 of the putative PilCl protein encoded within 
pilCl were analyzed for antigen index, hydrophilicity, 
and hydrophobicity using standard computer-modelling 

25 methods. The analysis indicated that the PilCl 

polypeptide fragment containing residues 300 to 700 would 
have several regions with a high antigen index, high 
hydrophilicity, and a high likelihood for location in an 

external domain. 
30 The immunogenicity of a recombinant polypeptide 

expressed from the DNA encoding amino acids 300 to 700 
was examined. The region of DNA encoding amino acid 
residues 300 to 700 was amplified by polymerase chain 
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reaction (PGR) using the following oligonucleotide 
primers . 

5' GGC TAG GTG GCA TAT GAA AGA TAG CGG 3' 
5 and 

5' TTT GCA ATC GG£ GAT CCT * C*A*G GTG TCT TTC 3' 

These primers incorporate an Ndel and a BamHI restriction 
endonuclease site (indicated by the underlined 

10 nucleotides) , respectively. A termination codon 

(indicated by the asterisks) was also incorporated. The 
PCR amplif ied DNA was then ligated into the vector pET3a 
(between the Ndel and BamHI sites) . The recombinant 
vector pET3a is used in the inducible expression system 

15 described by Studier et al. (1990), using the protocol 
described therein. Strain BL21 (DE3) was transformed 
with the pET3a-piIC (300-700) vector, and the transformed 
strain used for the expression of the PilC (300-700) 
peptide . 

20 The expression products after induction were 

analyzed by sodium dodecyl sulfate-polyacrylamide gel 
electrophoresis (PAGE) using an 8% acrylamide gel and the 
standard 25 mM Tris base / 250 mM glycine / pH 8.3 / 0.1% 
sodium dodecyl sulfate (SDS) electrophoresis buffer. 

25 After electrophoresis, the gel was fixed and then stained 
with Coomassie Blue according to standard protocols, and 
the production of PilC (300-700) was confirmed by the 
detection of the presence in the gels of an abundant, 
appropriately- sized peptide of approximately 46kD. 

30 In order to detect the immunogenicity of the 

PilC(300-700) product, the region of the SDS-PAGE gel 
containing the PilC (300-700) polypeptide was excised from 
parallel unstained lanes, homogenized, and the protein 
eluted into a buffer of 0.1% Triton X-100 in water by 
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passive diffusion. Rabbits were given a priming 
intradermal injection of homogenized gel slices 
(containing approximately 500/tg PilC(300-700) protein), 
followed 3 weeks later by a subcutaneous boost (of 
5 approximately 500 fig of eluted protein) . An initial test 
serum was then collected after an additional 14 days. 
All of the test animals yielded a specific high titer 
antibody response. The antibodies induced by PilC (3 00- 
700) were immunologically reactive not only with that 

10 polypeptide (i.e., PilCl (300-700) ) , but also with native 
PilCl and native PilC2. 

The results demonstrate, inter alia, the 
following. PilC contains antigenic epitopes that can 
elicit a strong immunogenic response. At least some of 

15 the immunogenic epitopes are shared (cross -reactive) 

between PilCl and PilC2, despite differences in primary 
amino acid sequence. The technique of subcloning 
discrete portions of the PilC protein under control of an 
inducible promoter allows mapping of antigenic epitopes. 

20 Sufficient quantities of specific oligopeptides of known 
antigenicity can be produced for use in screening the in 
vivo immune response after exposure to the intact 
pathogen; 

The following listed materials are on deposit 
25 under the terms of the Budapest Treaty with the American 
Type Culture Collection (ATCC) , 12301 Parklawn Dr. , 
Rockville, Maryland 20852, and have been assigned the 
following Accession Numbers. 

30 Description ATCC No. Deposit Date 

pABJ03 in E. coli (DH5) 68519 Jan. 28,. 1991 

pABJ04 in E. coli (DH5) 68520 Jan. 28, 1991 
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Upon allowance and issuance of this application 
as a United States Patent, all restriction on 
availability of these deposits will be irrevocably 
removed; and access to the designated deposits will be 
5 available during pendency of the above-named application 
to one determined by the Commissioner to be entitled 
thereto under 37 CFR 1.14 and 35 USC 1.22. Moreover, the 
designated deposits will be maintained for a period of 
thirty (30) years from the date of deposit, or for five 

10 (5) years after the last request for the deposit; or for 
the enforceable life of the U.S. patent, whichever is 
longer. The deposited materials mentioned herein are 
intended for convenience only, and are not required to 
practice the present invention in view of the 

15 descriptions herein, and in addition these materials are 
incorporated herein by reference. 



Industrial Applicability 
The invention, in the various manifestations 

20 disclosed herein, has many industrial uses, some of which 
are the following. The pilC DNAs may be used for the 
design of probes for the detection of pilC nucleic acids 
in samples. The probes derived from the DNAs may be used 
to detect pilC nucleic acids in, for example, chemical 

25 synthetic reactions. The polynucleotide probes are also 
useful in detecting viral nucleic acids in humans, and 
thus, may serve as a basis for diagnosis of pathogenic 
microorganisms containing type 4 pilin, for example, 
gonococcal and/ or meningococcal infections in humans. 

30 In addition to the above, the DNAs provided 

herein provide information and a means for synthesizing 
polypeptides containing epitopes of PilC. These 
polypeptides are useful in detecting antibodies to PilC 
antigens. A series of immunoassays the relevant 
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10 



neisserial infection, based on recombinant polypeptides 
containing pilC epitopes are described herein, and will 
find commercial use in diagnosing diseases caused by 
these microorganisms. In addition, the polypeptides 
derived from the pilC DNAs disclosed herein will have 
utility as vaccines for treatment of infections caused by 
meningococci and gonococci. 

The polypeptides derived from the pilC DNAs, 
besides the above stated uses, are also useful for 
raising anti-PilC antibodies. Thus, they may be used in 
vaccines against the relevant microorganisms. Moreover, 
the antibodies produced as a result of immunization with 
the polypeptides containing an immunoreactive PilC 
epitope are also useful as passive vaccines, or in the 
15 detection of the presence of PilC antigens in samples. 
Thus, they may be used to assay the production of 
polypeptides derived from PilC in chemical systems. The 
anti-PilC antibodies may also be used to monitor the 
efficacy of ant i -neisserial agents in screening programs 
20 where these agents are tested in tissue culture systems. 
Another important use for anti-PilC antibodies is in 
affinity chromatography for the purification of PilC 
derived polypeptides. The purified PilC polypeptide 
preparations may be used in vaccines. 
25 For convenience, the ant i- PilC antibodies and 

polypeptides containing regions encoded in pilC, whether 
natural or recombinant, may be packaged into kits. 
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CLAIMS 



1. A recombinant polynucleotide encoding a 
polypeptide comprised of an immunoreactive epitope of a 

5 protein encoded in pilC of Neisseria. 

2. The recombinant polynucleotide of claim 1, 
wherein the immunoreactive epitope is encoded in pilC of 
Neisseria gononrheae. 

10 

3. The recombinant polynucleotide of claim 2, 
wherein the immunoreactive epitope is encoded in pilC2 of 
Neisseria gonorrheae. 



15 4. The recombinant polynucleotide of claim 2, 

wherein the immunoreactive epitope is encoded in pilCl of 
Neisseria gonorrheae. 

5. A vector comprised of a recombinant 

20 polynucleotide, wherein the recombinant polynucleotide is 
selected from the group consisting of the recombinant 
polynucleotide of claim 1, the recombinant polynucleotide 
of claim 2, the recombinant polynucleotide of claim 3, 
and the recombinant polynucleotide of claim 4. 

25 

6. A host cell transformed with the vector of 

claim 5, 



7. A recombinant expression system comprising 
30 a polynucleotide encoding a polypeptide comprised of an 
immunoreactive epitope of a protein encoded in pilC of 
Neisseria, wherein the polynucleotide is operably linked 
to a control sequence compatible with a desired host. 
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8. The recombinant expression system of claim 

7, wherein the imniunoreactive epitope is encoded in pilC 
of Neisseria gonorrheae. 

9. The recombinant expression system of claim 

8, wherein the immunoreactive epitope is encoded in pilCl 
of Neisseria gonorrheas. 

10. The recombinant expression system of claim 
8, wherein the immunoreactive epitope is encoded in pilC2 
of Neisseria gonorrheae. 



11. A cell transformed with a recombinant 
expression system, wherein the expression system is 
15 selected from the recombinant expression system of claim 
7, the recombinant expression system of claim 8, the 
recombinant expression system of claim 9, and the 
recombinant expression system of claim 10. 

20 12. A polypeptide produced by the cell of 

claim 11. 

13. A purified polypeptide comprised of an 
immunoreactive epitope of a protein encoded in pilC of 

25 Neisseria. 

14. A purified polypeptide according to claim 
13, wherein the immunoreactive epitope is encoded in pilC 
of Neisseria gonorrheae. 



15. A purified polypeptide according to claim 
14, wherein the immunoreactive epitope is encoded in 
pilC2 of Neisseria gonorrheae. 
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16. A purified polypeptide according to claim 
14 f wherein the imnrunoreactive epitope is encoded in 
pilCl of Neisseria gonorrheae. 

5 17. A recombinant polypeptide comprised of an 

immunoreactive epitope of a protein encoded in pilC of 
Neisseria. 

18. A recombinant polypeptide according to 
10 claim 17, wherein the immunoreactive epitope is encoded 
in pile of Neisseria gonorrheae. 



19. A recombinant polypeptide according to 
claim 18, wherein the immunoreactive epitope is encoded 

15 in pilC2 of Neisseria gonorrheae. 

20. A recombinant polypeptide according to 
claim 18, wherein the immunoreactive epitope is encoded 
in pilCl of Neisseria gonorrheae. 

20 

21. A method of preparing a recombinant 
polypeptide comprised of an immunoreactive epitope of a 
protein encoded in pilC of Neisseria, the method 
comprising: 

25 a. providing a host cell according to claim 

11; 

b. incubating the cell under conditions which 
allow expression of the recombinant polypeptide; and 

c. isolating the polypeptide. 



30 



35 



22. A vaccine composition for the treatment of 
Neisseria infection, comprised of a pharmaceutical^ 
acceptable excipient and of an effective amount of a 
recombinant polypeptide, wherein the polypeptide is 
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comprised of an immunoreactive epitope of a protein 
encoded in pilC of Neisseria.. 

23. The vaccine composition of claim 22, 

5 wherein the immunoreactive epitope is encoded in pilC of 
Neisseria gonorrheae. 

24. A polypeptide affixed to a solid 
substrate, wherein the polypeptide is selected from the 

10 group consisting of the polypeptide of claim 12, the 

polypeptide of claim 13, the polypeptide of claim 14, the 
polypeptide of claim 15, the polypeptide of claim 16, the 
polypeptide of claim 17, the polypeptide of claim 18, the 
polypeptide of claim 19, and the polypeptide of claim 20. 

15 

25. An immunoassay for detection of anti- 
Neisseria. antibodies comprising: 

(a) providing a sample suspected of containing 
anti -.Neisseria antibodies; 

20 (b) providing an antigen, wherein the antigen 

is a polypeptide selected from the group consisting of 
the polypeptide of claim 12, the polypeptide of claim 13, 
the polypeptide of claim 14, the polypeptide of claim 15, 
the polypeptide of claim 16, the polypeptide of claim 17, 

25 the polypeptide of claim 18, the polypeptide of claim 19, 
the polypeptide of claim 20; and 

(c) incubating the sample of (a) with the 
antigen of (b) under conditions which allow the formation 
of antibody- antigen complexes; and 

30 (d) detecting the presence of ant i -Neisseria 

antibody- antigen complexes formed in (c) , if any. 



35 



26. A composition comprised of a polypeptide, 
wherein the polypeptide is selected from the group 
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consisting of the polypeptide of claim 17, the 
polypeptide of claim 18, the polypeptide of claim 19, and 
the polypeptide of claim 20. 



5 27 . A composition comprised of purified 

polyclonal anti-PilC antibodies, wherein the PilC is of 
Neisseria. 



28. A composition comprised of a monoclonal 
10 antibody directed against an imrrtunoreactive epitope 

encoded in pile of Neisseria. 

29. An immunoassay for detection of an antigen 
encoded in pile of Neisseria comprising: 

15 (a) providing a sample suspected of containing 

an antigen encoded in pile of Neisseria; 

(b) providing a composition comprised of 
antibodies directed against the antigen encoded in pile 
of Neisseria, wherein the composition is selected from 

20 the group of compositions of claim 27 and claim 28; 

(c) reacting the sample of (a) with the 
antibody containing composition of (b) under conditions 
which allow the formation of anti-PilC antibody- antigen 
complexes ; 

25 (d) detecting anti-PilC antibody- antigen 

complexes formed in (c) , if any. 



30. A kit for analyzing samples for the 
presence of anti-PilC antibodies comprising: 
30 (a) an antigen packaged in a suitable 

container, wherein the antigen is a polypeptide selected 

from the group consisting of the polypeptide of claim 12, 

the polypeptide of claim 13, the polypeptide of claim 14, 

the polypeptide of claim 15, the polypeptide of claim 16, 
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the polypeptide of claim 17, the polypeptide of claim 18, 
the polypeptide of claim 19, and the polypeptide of claim 
20; 

(b) a buffer used in the performance of the 
analysis, packaged in a suitable container; and 

(c) instructions on the performance of the 
analysis which uses the antigen of (a) and the buffer of 
(b). 



31. A kit for analyzing samples for the 
presence of an antigen comprised of an immunoreactive 
epitope encoded in pile of Neisseria comprising: 

(a) a composition comprised of antibodies 
directed against the antigen comprised of an 

15 immunoreactive epitope encoded in pile of Neisseria, 
wherein the composition is selected from the group of 
compositions of claim 27 and claim 28, wherein the 
composition is packaged in a suitable container; 

(b) a buffer used in the performance of the 
analysis, packaged in a suitable container; and 

(c) instructions for performing the analysis. 



32. A method for producing antibodies to Pile 
of Neisseria comprising administering to an individual a 
25 composition comprised of an isolated immunogenic 
polypeptide containing a PilC epitope in an amount 
sufficient to produce an immune response to the PilC 
epitope. 

30 33. An oligomer capable of .hybridizing to a 

sequence in pile of Neisseria, wherein the oligomer is 
comprised of a pile sequence complementary to at least 
about 6 contiguous nucleotides of pile. 
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34. An oligomer according to claim 33, wherein 
pilC is of Neisseria gonorrheae. 

35. A process for detecting a pilC sequence in 
5 an analyte strand, wherein the pilC sequence comprises a 

selected target region, the process comprising: 

(a) providing a sample comprised of an analyte 
strand suspected of containing a selected target pilC 
sequence; 

10 (b) providing an oligomer capable of 

hybridizing to the target pilC sequence, wherein the 
oligomer is comprised of a pilC targeting sequence . 
complementary to at least about 6 contiguous nucleotides 
of pilC; 

15 (c) incubating the sample of (a) with the 

oligomer of (b) under conditions which allow specific 
hybrid duplexes to form between the targeting sequence 
and the target sequence; and 

(d) detecting hybrids formed between the 

20 target sequence, if any, and the oligomer. 

36. - The process of claim 35 which further 

comprises : 

(a) providing a set of oligomers which are 

25 primers for a polymerase chain reaction (PCR) method and 
which flank the target region; and 

(b) amplifying the target region via the PCR 

method. 

30 37. A kit for detecting a pilC sequence in an 

analyte strand comprising: 

(a) the oligomer of claim 33, packaged in a 
suitable container; 
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(b) a suitable buffer, packaged in a suitable 

container; and 

(c) instructions for performing the detection. 

38. A recombinant polynucleotide comprising a 
DNA sequence of at least 8 contiguous nucleotides from 
pilC F wherein the pile sequence is selected from the 
group of sequences shown in Figure 3, Figure 6, and 
Figure 7. 



39 . A method of treating an individual for a 
Neisseria infection comprising administering to the 
individual antibodies produced according to claim 32 , 
wherein the antibodies are administered in an amount 

15 effective to prevent the pathology of the infection.. 

40. An immunoassay for detection of anti- 
Neisseria antibodies comprising: 

(a) providing a sample suspected of containing 
20 anti-Neisseria antibodies; 

(b) providing an antigen, wherein the antigen 
is the polypeptide of claim 24; 

(c) incubating the sample of (a) with the 
antigen of (b) under conditions which allow the formation 

25 of antibody-antigen complexes; and 

(d) detecting the presence of anti-tfeisseria 
antibody-antigen complexes formed in (c) , if any. 

41. A kit for analyzing samples for the 
30 presence of anti-Pile antibodies comprising: 

(a) an antigen packaged in a suitable 
container, wherein the antigen is a polypeptide from 
claim 24; 
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(b) a buffer used in the performance of the 
anaylsis, packaged in a suitable container; and 

(c) instructions on the performance of the 
analysis which uses the antigen of (a) and the buffer of 

5 (b). 
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10 30 50 

GATCCGCCCGG7GCTTGGGGCCCTTAGGGAACCGTTCCCTTTGAGCCGGGGCGGGGCAAC 

70 90 "0 

GCGTACCGGTTTT7GTTAATCCGCTATAAAAGGCGGGCTATAGGGTAGGCTTCATCCTC 

130 150 "0 

CCAATCTCACTGAATCCGTCAATTTCCGCAATTCAATTAAATACCGTCAAACCGATGCCG 

190 210 230 

TCATTCCGCGCAGGCGGGAATCCGGACCGGTCGGGCATCTGCGGCGGTTTGCTAAAAAAC 

250 270 290 

GCTTTACCGTGATAAGTGCGCAAAGTTAAAATGGGGAGGTAAGCTTTTCAATCAGCAATC 

310 330 350 

CGGCGGGCGCGGAATCGGGCGGTXTACCGAACCCCGGCGTTCGCGGCGCCCGTCCCGCGA 

370 390 410 

AGGCAAACTTAAGGAATAAAATATOAATAAAACTTTGAAACGGCAGGTTTTCCGCCATAC 

430 450 470 

CGCGCTTTATGCCGCCATCTTGATGTTTTCCCAtTACCGGCGGGGGGGGGGGGCGATGGCG 

490 510 S30 

CAJUiCCCATCAATACGCTATTATCATGAACGAGCGAAAreAGCCCGAGGTAAAGCAGAAT 

550 570 590 

GTGCCXTCTTCJmTAAAGGACAAAGACAGGAGGCGCGAATATACTTATTATACGCACAGA 

610 630 €50 

ACAGGAG6AGGCTCTGTC7CATTCAACAATAACGATACCCTTGTTTCCCAACAAAGCGGT 

670 690 * 710 

ACTGCCGTTTTTGGCACAGCCACCTACCTGCCGCCCTACGGCAAGGTTTCCGGTTTTGAT 

730 750 770 

GCCGTCGCTCTGAAAGAGCGCAACAATGCCGTTGATTGGATTCG2ACCACCCGCAXCGCG 

790 610 830 

CTGGCAGGCTACTCCTACATCGACGTCATATGCAGAAGCTACACAGGCTGTCCCAAACTT 

850 870 890 

GTCTATAAAACCCGATTTACCTTCGGTCAACAAGGGTTGAAAAGAAAGGCAGGCAGCAAG 

910 930 950 

CTGGATATATACGAAGACAAAAGCCOCGAAAATTCGCCCATTTACAAATTGTCGGATTAT 

970 990 1010 

CCTTGGTTGGGCGTATCTTTCAATTTGGGCAGCGAGAATACCGTCCAAAATAGCAAATTA 

1030 1050 1070 

TTCAACAAATTGATATCTTCTTTTAGAGAAGGCAATAATAATCAAACCAXCGTCTCTACG 

1090 1110 1130 

1150 1170 H90 

TATCTGAACGCCAAACTGCACCTGCTGGACAAAAAAGGGA1TGAAGATATCGCCCAAGGC 

1210 1230 3250 

AAAATAGTGGATTTGGGTATCTTGAAACCGCACGTCGAGACGACAGGACGAAGCTTGCTA 

1270 1290 1310 

OATTTTTGGGCTAGGTGGGACATTAAAGATACCGGGCAGATTCCGGXCAAGCTCGGCCtrG 

1330 1350 1370 

CCGCAAGTCAAAGCAGGCCGC7GCACCAACAAACCGAACCCCAATAATAATACCAAAGCC 

1390 1410 1430 

CCTTCGCCGGCACTGACCGCCCCCGCGCTGTGGXTCGGACCCGGGCAAGATGGTAAGGCG 

1450 1470 1490 

GAGATGTATTCCGCTTCGGTTTCCACCTACCCCGACAG1TCGAGCAGCCGCATCTTCCTC 

1510 1530 1550 

CAAGAGCTGAWVACTCAAACCGAACCCGGCAAACCCGGCCGCTATTCCCTCAAATCTTTG 

1570 1590 1610 

AATGATGGTGAGATTAAAAGTCGACAGCCGAGTTTCAACGGGCGGCAAACAATCATCCGA 

1630 1*50 1670 

TTGGATGACGGCGTACATTTGATCAAACTGAATGGAAGCAAGGATGAGGTCGCCGCTTTT 

1690 1710 3730 

GTCAATTTAAATGGAAACAACACCGGCAAAAACGACACTTTCGGCATTGTTAAGGAAGCG 

1750 1770 1790 

AACGTCAATCrTGACGCCGACGAGTGGAAAAAAGTGCTGCTGCCTTGGACGGTTCGGGGT 

1010 1B30 1550 

CCCGATAATGACAATAAATTTAAATCAATTAACCAAAAACCAGAAAAATACAGCCAAAGA 

1870 1890 1910 
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WCCGCMCCGCGACAAC^^ 

1950 109A 

OTCQCQOICGOCGGGTATTTGCCAACCGCCGCQAACOACQOaikTQOTQCATATCTTCAAA 
Q 2010 5ftift 

'Ujy 2070 9noA 

.TCCCSCGC^OGATATTCAAAGCCAMJUVICCACCCTTGCCIUU^AGCTGCGCGCCTST 
6 CC S AAAAAGQ MAICT ^ 

2190 5$1A 
GTCGAWTGAOCGOGCyuUAACACaTOTWATW^ 

^GTATG«T7G6ATTTAA^ 

** w 2310 2330 



TTTGATGTC^ATGGCGATAATAA^CAAAAA^C^ 
ACCGTCG37ACQCCGCAAATCGGCJU^^TCCGCAACGGCAAATAC^MCCW 



2430 



2370 2390 

CCGCC 

GATTTGAAAGACACCT^^ 
CKWCTTTCGTCCCCCACGCTGGIGGAT^ 

GCCGGCG^CCGGGGCGGCAATATGTACWCTTTGATTTGAGCAAITCCGAITCTAGTAAA 

2^70 9£QA 

TGGTCTGCAAAGGTTATTTTCGAAGGCGACAAGCCGA 
CSACTGGCAG^U^CGTCGXCAT^ 

OTACIGAATACGGGCGAACAATAIATWACGGTATCITTGACGACSATA^ 

aataaaacattattckgaacaagagaScgacggttcg^ 

AAATXGAOOOAAOGAGAACGCGTTACCGTCAAACCGACCGTGGTATTGCG 
GTAACCAICCGCAAATATAACGACGG«G^ 

AAIACCGCCGACGGCGGCGCATTGACICTOAGAAGCGCGCGCCCGATTGTO 

AATTCGGTTGCGCAATATTCCGGCCATAA^ACAACCTCCAAAGGCAA^TCCATCCCTATA 
C^TGTATGGACAAAGACGO,AAAAc|wXGCC^ 

GTTAATGTSCGTIATCTGGAIGAAACGGAAACWGACGGATTTTCAACQA^GGCGGACGGC 



GATGCGGGCGGCAGCGGTATAGACCCCGCCGGCAGGCGTCCCGGC AAAAACAACCGCTGC 

ttcicc^aaaaaggggtgcgca^ctgIwatgaacgatt^ 



3330 33S0 

•<«»>,»»! ATASACCC 

3430 VAQGCGT0C6CAC CCT 
CKWCCGAIGTGCGGIATCAAACGCTTWGCTGG^^ 
GCGGCCGGTTTTTCCGCAAAIGCCGTCMA^GGCCr^ 
TTCGGGAGGGGGGCGGCAAATGAAACG 
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10 30 50 

GAXCCGCCCGGTGCXXGGGGCCCXXAGGGAACCGXXCCCXXXGAGCCGGGGCGGGGCAAC 
70 go ' 110 

GCGXACC6GTTXTTGXXAAXCCGCXAXAAAAGGCGGGCXAXAGGGXAGGCXXCAXCCXG 

130 150 170 

CCAATCTCACTOAATCCGTCAATTTCCGCAATTCAATTAAATACCGTCAAACCOATGCCG 

190 210 230 

TCATTCCaCGCAGGCGGGAATCCGGACCGGTCGGGCATCTGCGGCGOTTTGCTAAAAAAC 

250 270 290 

GCXXXACCGXGAXAAGXGCGCAAAGXXAAAAXGGGGAGGXAAGCXXXTCAAXCAGCAATC 

310 330 330 



370 390 410 

AGGCAAACXXAAGGAAXAAAAXAXGAAXAAAACXTTGAAACGGCAGGXXXXCCGCCAXAC 

MNRTX.KRQVFRHX 
430 450 470 

CGCGCTTTAXGCCGCCAXC7XGATGTTXXCGCAXACCGGCGGGGGGGGGGGGGCGAXGGC 
A L Y A A I LMFSHXGGGGGAMA 
490 510 530 

GCAAACCCATCAATACGCTATTATCATGAACGAGCGAAACCAGCCCGAGGTAAAGCAGAA 
Q * H Q Y A X IMNERNQPEVKQN 
550 570 590 

TGXGCCAXCXXCAAXAAAGGACAAAGACAGGAGGCGCGAAXAXACXXATTATACGCACAG 
V *$$IKDKDRRREYXYYTHR 
610 630 650 

AACA6GAGGAGGCTCTGTC7CATTCAACAATAACGATACCCTTGTTTCCCAACAAAGCGG 
TGGG$V5rNN*JDTLVSQQSG 
670 690 710 

TACTGCCGTTXTTGGCACAGCCACCTACCTGCCGCCCTACGGCAAGGTTTCCCGTTTTGA 
TAVFGTATY LPP'XGKVSGIfD 
730 750 770 

TGCCGTCGCTCTGAAAGAGCGCAACAATGCCGXTGATTGGATTCGTACCACCCGCATCGC 
AVALKBRNNAVDWIRTTRIA 
790 010 830 

GCTGGCAGGCTACTCCTACATCGACGTCATATGCAGAAGCTACACAGGCTGTCCCAAACT 
LAGVSY IDVlCRSYTGCPKIi 
850 870 890 

TGTCTATAAAACCCGATTTACCTTCGGTCAACAAGGGTTGAAAAGAAAGGCASGCAGCAA 
VYKTRFrrrGQQGIiKRKAQSK 
910 930 950 

GCXGGAXAXAXACGAAGACAAAAGCCGCGAAAAXXCGCCCAXXTAGAAAXXGXCGGAXTA 
1-DIVEDKSRB NSPirKIiSDY 
970 990 1010 

TCCXTGGXXGGGCGXAXCX77CAAXXXGGGCAGCGAGAAXACCGXCCAAAAIAGCAAAXX 
PWLGVSFNIjGSBKXVQNSKI* 
1030 1050 1070 

AXXCAACAAAXXGAXAXCXXCXXXXAGAGAAGGCAAXAAXAAXCAAACCAXCGXCXCXAC 
rNKLISSTREGNNNQXlVST 

1090 mo 1130 

GACAGAAGGCAACCCXAXXTCCCXXGGCGACCGGCAGCGCGAACAXACCGCCGXGGCCXA 
TEGNPISLGDRQREHXAVAY 
1150 1170 1190 

TXAXCXGAACGCCAAACXGCACCXGCXGGACAAAAAAGGGAXXGAAGAXATCGCCCAAGG 
Y LNAK Lr H L L D KK G I E D IAQG 
1210 1230 1250 

CAAAAXAGXGGATXTGGGXAXC7XGAAACCGCACGXCGAGACGACAGGACGAAGCXXGCX 
K1VDLGILKPHVEXXGRS hi* 
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1270 1290' 1310 

AGA77X77GGGC7AGG7GGGACA77AAAGA7ACCGGGCAGA77CCGG7CAAGC7CGGCC7 
DFWARWDXKDTGQIPVKLGL 

1330 1350 1370 

GCCGCAAG7CAAAGCAGGCCGC7GCACCAACAAACCGAACCCCAA7AA7AA7ACGAAAGC 
PQVKAGRC7NKPNPNNNTKA 

1390 1410 1«0 

CCCTTCGCCGaCACTGACCGCCCCCGCGCTGTGGTTCGGACCCGGGCAAGATGGTAAGGC 
PSPAL7APALWFGPG0DGKA 

1450 1470 1490 

GGAGATGTATTCCSCTTCGGTTTCCACCTACCCCGACAGTTCGAGCAGCCGCATCTTCCT 
EMVSASV5TYPDSSS6RIFL 

1510 1530 1550 

CCAAGAGC7GAAAAC7CAAAGCGAACCCGGCAAAGCCGGCCGC7AT7CCC7CAAA7C777 

QELKTQTEPGKPGHrSLK SL 

1570 1590 1610 

GAA7GA7GG7GAGA7TAAAAG7CGACAGCCGAG777CAACGGGCGGCAAACAA7CA7CCG 
NDGEXK3R0P8FNORQ7XXR 

1630 1650 1670 

ATTOGATGACGGCGTACATTXOATCAAACTOAATGGAAGCAAGGATaAGaTCGCCGCTTT 
LOD -GVHLIKLNG5 KDEVAAF 
1690 1710 1730 

707CAA777AAA7G0AAAGAACACCGGCAAAAAC0ACAC777CGGCA77G77AAGGAAGC 
VNLN GNN 7GKNDTFGIVKEA 

1750 1770 1790 

GAACG7CAA7C77GACGCCGACGAG7GGAAAAAAG7GCXGC7GCC770GACGG77CGGGG 
NVNLDADEWKKVLLPW7VRG 

1810 1830 1050 

7CCCGA7AA7GACAA7AAA77TAAA7CAA77AACCAAAAACCAGAAAAA7ACAGCCAAAG 
PDNDMKFK3XNQKPEKYSQR 

1070 1890 1910 

A7ACCGCA7CCGCGACAACAACGGCAA7CGCGA777GGGCGACA7CG7CAACAGCCCGA7 

VR X RDNNGNRDXjGD X V N 5 P X 
1930 1950 1970 

7G7CGCGG7CGGCGGG7A7T7GGCAACCGCCGCGAACGACGGGA7GG7GCA7ATC7TCAA 
VAVGGTLA7AANDGMVH I F K- 

1990 2010 2030 

AAAAAACGGCGGCAG7GA7GAACGCAGC7ACAA7C7GAAGC7CAGC7ACA7CCCCGGCAC 
KNGGSDERSYNIiKfcSYIPGT 

2050 2070 2090 

GA7GCCGCGCAAGGATA77CAAAGCCAAGAA7CCACCC77GCCAAAGAGCTGCGCGCCT7 

MPRKDXQSQE3TLAKEI*RAr 

2110 2130 2150 

7GCCGAAAAAGGC7A7GTGGGCGACCGC7ACGGCG7GGACGGCGGC7T7GXC77GCGCCA 

AEKGVVGDRTGVDGGFVLRQ 

2170 2190 2210 

AG7CGAAC7GAGCGGGCAAAAACACG7G7T7A7G77CGGCGCGA7GGG7777GGCGGCAG 

VE IiSGQKHVFMFGAMGFGGR 
2230 2250 2270 

GGGCGCG7A7GCC7XGGA777AAGCAAAA7CAACGGAAA77A7CCGGCCGCCGCCCCCCX 

GAYALD LSK INGNYP AAAP I* 
2290 2310 2330 

G777GA7GTCAAAGA7GGCGA7AAXAACGGCAAAAA7CGCG7GAAAG7GGAAT7AGGC7A 
FDVKDGDNNGKNRVKVKLGr 

2350 2370 2390 

CACCG7CGG7ACGCCGCAAA7CGGCAAAAXCCGCAACGGCAAA7ACGCCGCC77CC , rCGC 
TVG7PQXGKXRNGKVAAFI-A 

2410 2430 2450 

C7CCGG77A7GCGGC7AAAAAAA7XGACGAC7CAACAAA7AAAACCGCGC7G7A7G7A7A 
SGVAAKKIDDST. NKXAIiYVt 

2470 2490 2510 

7GA777GAAAGACACC77AGGTACGCCGA77GCAAAAA7CGAAG7GAAGGACGGCAAAGG 
DLKDTLGTP XAKIEVKDGKG 
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2330 2550 2370 

CGGGCTTTCGTCCKCACGCTGGTGGATAAAGATTTG3ACGGCACG3TCGATATCGCCTA 
OLSSPTLVOKOliDOT V D X A Y 
2590 2610 2630 

T0CC0OC0ACCOOaaC8OCAATAT0IACCQCtT»(»TTTejlGCaXTTCC(»TTCTJUiXJUl 
A0DR08MMVRFDI.SNSDS6K 
2650 2670 2690 

ATOOTCTGCJUUkGeWATTTTCeAWSCCeJlCAAGCCOIlTTACCTCCOCaCCCaCCaTTTC 
W&AKVXPSGDKP XT6APAVS 
2710 2730 2730 

CCOACTGOCAGACAAACGCGTCGTCATCTTCGGTACGGGCAGCGATTTGACCGAAaATGA 
RLADKRVVIPGTG6DLTEDD 
2770 2790 2810 

TGTACTGAATACGGGCGAACAATATATTTACGGl'ATCTTCGACGACGATAAGGSGACGGT 

2830 2830 2870 

TAAGGTAACGGTACAAAACGGCACG6CAGGCGGGCTGCTCGAGCAACACCTTACTCAGGA 
V .L- V QNOTAGGliX,EQHLTQE 
2B90 2910 2930 

AAATAAAACA7TATTCCTGAACAASAGATCCGACSG3XCGGGCAGCAAGGGCTGGGCGGT 
NK * LrLMK *SDGSGSlCOWAV 
2930 2970 2990 

GAAATTOAOOGAAOGAGAACGCGTTACCGTCAAACCCACCGTGGTATTGCGTACCGCCTT 
KLR&GERVTVXPTVVLRTAF 
3010 3030 3050 

COTAACCATCCGCAAATATAACGACGGCGGCTGCGGCGCGGAAACCGCCATTT7GGGCAT 
VIIRKVNDGGCGAETA2LGI 
3 "° 30}o 3110 

CAATACCGCCGACGGCGGCGCATTGACTCC8AGAAGC6CGCGCCCGA9TGTGCCGGATCA 
T . A .- D 0OA »'TPRSARPtVPDH 
3130 3150 3170 

,Y.« A 0lf8 OHKTTSKGXStPI 
3190 32i(j 3230 

AGQTTGTATGGACAAAGACGQTAAAACCGTCTGCCCGAACGGATATGTTTACGACAAGCC 
C »»-A° KDGK *VCPNG*VYDKP 
3250 3270 3290 

OOTCAATGTGCGTTATCTGGATGAAACGGAAACAGACGGATTTTCAACaACGGCGOACGG 
™ R VLD BTBIDGFSTTADG 
3310 3330 3350 

COATGCOGGCGGCAGCGGTATAGACCCCGCCGGCAGGCGTCCCGGCAAAAACAACCGCTG 

3370 3390 3410 

CTTCTCCAAAAAAGGGGTGC6CACCCTGCTGATGAACGATTTGGACAGCTTGGATATTAC 
6VRT *»fcMNDLD3t.DXT 
3430 3450 3470 

CGGCCCOATGTGCGOTATCAAACGCTTAAGCTGGCGCGAAGTCtTCTTCTGACCGGCCTG. 
A. C 0IK *I-3WREVFr* 
90 3310 3530 

CGCGGCCGGTTTTTCCGCAAATGCCGTCCGAAAGGCCTTCGGACGGCATTTTT77GCGTT 

3550 

MTCOGGAGGGGGGCGGCAAATGAAACO 
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10 30 SO 

OATCCGCCCQGTGCTTGGGGCCCTTAOOGAACC G IT CCC TTTQAOCCOOQGCGGGGCAAC 

70 90 110 

GCGTACCGGTTTTTGTTAATCCGCTATAAAAGGCGGGCTATAGGGTJtfKiCTTCATCCTG 

130 ISO 170 

CCAATCTCACTGWlTCCGTCAATTTCCGCAATTCAATTAAATACCGTCiUlACCGATGCCG 

190 210 230 

TCATTCCGCGCAGGCGGGAATCCGGACCGGTCGGGCATCTGCGGCGGTTTGCTAAAAAAC 

250 270 290 

GCTTTACCGTGATAAGTGCGCAAAGTTAAAATGGGGAGGTAAGCTTTTCAATCAGCAATC 

310 330 350 

CGGCGGGCGCGGAATCGGGCGGTTTACCGAACCCCGGCGTTCGCGGCGCCCGTCCCGCGA 

370 390 410 

AGOCAAACTTAAGGAATAAAATATGAATAAAACTTTGAAACGGCAGGXTTTCCGCCATAC 
t SIM 1 MKKTLKRQVFRHT 
430 450 470 

CGCGCTTTATGCCGCCATCTTGKTGTTTTCGCATACCGGCGGGG<^GGGGGGCGATGGCG 
A L Y A A I tiHFSHTGGGGGRWR 

ftltti 2 M A 
490 510 530 

CAAACCCATCAATACGCTATTATCATGAACGAGCGAAACCAGCCCGAGGTAAAQCASAAT 

KPINTLL6* 
QTHQYAt XMNERNQPEVKQN 

550 570 590 

GTGCCATCTTCAATAAAGGACAAAGACAGGAGGCGCGAATATACTTACTATACGCACACiA 
VP8 8 XKDKDRRREXTYYTHR 

630 630 650 

ACAGGAGSAGGCTCTGTCTCATTCAACAATAACGATACCCTTGTTTCCCAACAAAGCGGT 
T0GG8VSFHNNDTLVSQQ6G 

6^0 690 710 

ACTGCCGTTTTTGGCACAGCCACCTACCTGCCGCCCTACGGCAAGOTTTCCGGTTTTGAT 
TAVrGTATYLPPrOKVSGKO 

730 750 770 

GCCGTCGCXCTGAAAGAGCGCAACAATGCCGTTGATXGQATTCGTACCACC5CGCATCGCG 
AVAtiKERNNAVDWXRTTRIA 

790 010 830 

CTGGCAGGCXACTCCTACATCGACGTCATATGCAGAAGCTACACAGGCTGTCCCAAACTT 
LAGYS VIOVICRSYTGCPKL 

050 670 690 

GTCTATAAAACCCGATTTACC7TCGGTCAACAAGGGTTGAAAAGAAAGGCAGGCAGCAAG 
VYKTRFTrGQQGLKRKAGSK 

910 930 9S0 

CTGGATATATACGAAGACAAAAGCCGCGAAAATTCGCCCATTTACAAATTGTCGGATTAT 
LOtYEDKSRCKSPIYKLSDY 

970 »90 1010 

CCTTGGTTGGGCGTATCTTTCAATT'XGGGCAGCGAGAATACCGTCCAAAATAGCAAATTA 
P W L G VS F KXiGS EN7VQM S KL 

3 030 1030 1070 

TTCAACAAATTGATATCTTCTTTTAGAGAAGGCAATAATAATCAAACCATCGTCTCTACG 
T U K U I SSFREGNNNQTIVST 

1090 mo U30 

ACAGAAG3CAACCCXATTTCCCTTGGCGACCGGCAGCGCGAACATACCGCCGTGGCCTAT 
TEGNP I3LGDRQREHTAVAY 

H50 H70 . 1190 

TATCTGAACOCCAAACTGCACCTGCTGGACAAAAAAGGGATTGAAGAXATCGCCCAAGGC 
VLNAKIiKLXiDKKGIEDt AQG 

1210 1230 1230 

AAAATAGTGGATTTGG3TATCTTGAAACCGCACGTCGAGACGACAGGACGAAGCTTGCTA 
KIVDLGZLKPHVETTGRSZiL 

1270 1290 1310 

GATTTTTGGGCTAGGTGGGACAWAAAGATACCGGGCAGATTCCGGTCAAGCTCGGCCTG 
DFWARWD IKOTGQ I P V K L G L. 
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1330 13S0 1370 

CCGC»AG7CAAAGCAGGCCGC?GCACCAACAAACCGAACCCCAAT 
PQVKAGRCTNKPNPNNN7KA 

1390 1410 1430 

CCTTCGCCGGCACTGACCGCCCCCGCGCTGTGGTTCGGACCCGGGCAAGATGGTAAGGCG 
PSPALTAPALWFGPGQDGKA 

1450 1470 1490 

GAGATGTATTCCGCTTCGGTTTCCACCTACCCCGACAGTTCGAGCAGCCGCATCX7CCTC 
EMVSA3VSTYPD33SSRIFL 

1510 1530 1550 

CAAGAGCTGAAAACTCAAACCGAACCCGGCAAACCCGGCCGCTATTCCCTCAAATCTTTG 
QELK7Q7EPGKPGR¥SfcK$L 

1570 1S90 1610 

AATOATOaiaAGATTAAAAGTCGACAGCCGAGTTTCAACGGGCGGCAAACAATCATCCGA 
NDGEXKSHQPSFNGRQTIIR 

1630 1650 1670 

TTGGATGACGaCGTACATTTGATCAAACTGAATGGAAGCAAGGATGAGGTCGCCGCTTTT 
LDOGVHlilKLNGSKDEVAAF 

1690 1710 1730 

OTCAATTTAAATGGAAACAACACCGGCAAAAACGACACTTTCGGCATTGTTAAGGAAGCG 
VNLNGNNTGKNDTFGIVKEA 

1750 X770 1790 

AACGTCAATCTTGACGCCGACGAGTGGAAAAAAGTGCTGCTGCCTTGGACGGTTCGGGGT 
NVNLDADEWKKVLLPWTVRG 

1810 1830 1830 

CCCGATAATGACAATAAA777AAATCAATTAACCAAAAACCAGAAAAA7ACAGCCAAAGA 
PDNDNKFKSZNQKPEKYSQR 

1870 1890 1910 

TACCGCA7CCGCGACAACAACGGCAATCGCGATTTGGGCGACA7CG7CAACAGCCCGATT 
YRZRDKNGNRDIiGDZVNSP I 

1930 1930 1970 

G7CGCGG7C0GCGGGTA777GGCAACCGCCGCGAACGACG0GA7GG7GCA7A7C7TCAAA 

VAVGGVLA7AANDGMVH IFK 

1990 2010 2030 

AAAAACGGCGGCAGTGA7GAACGCAGC7ACAA7CTGAAGCTCAGCTACA7CCCCGGCACG 
KKQG8D&R8YNLKL5YZPG7 

2050 2070 2090 

ATGCCGCGCAAGGATA77CAAAGCCAAGAA7CCACCC77GCCAAAaAGC7GCGCGCCT7T 
MPRKD I Q SQ&S TLAKELRAF 

2110 2130 2130 

GCCGAAAAAGGC7ATG7GGGCGACCGC7ACGGCG7GGACGGCGGCT77G7C77GCGCCAA 

AEKGY VGDRYGVDGGFVIiRQ 

2170 2190 2210 

G7CGAACTGAGCGGGCAAAAACACGTG7X7A7GX7CGGCGCGATGGG7TT7GGCGGCAGG 

V E L 8 GQKHVFMFGAMGFGGR 

2230 2230 2270 

GGCGCGTATGCCTTaGATTTAAGCAAAATCAACGGAAA r rrATCCGGCCGCCGCCCCCCTG 
G A Y A h ' DLSKINGNYPAAAPli 

2290 2310 2330 

TTTGATG7CAAAGATGGCGATAATAACGGCAAAAA7CGCGTGAAAG7GGAATXAGGCTAC 
FDVKDGDNNGKNRV .KVEliGY 

2350 2370 2390 

ACCGTCGG7ACGCCGCAAATCGGCAAAA7CCGCAACGGCAAATACGCCGCCTTCCTCGCC 
TVG7PQ XGKXRNGKYAAFXjA 

2410 2430 2430 

TCCGGTTATGCGGCTAAAAAAATTGACGAC7CAACAAA7AAAACCGCGC7GTATGTATAT 

3GYAAKKIDDS 7NK7AIiYVY 

2470 2490 2510 

GA77TGAAAGACACCTTAGGTACGCCGATTGCAAAAATCGAAGTQAAGGACGGCAAAGGC 
DIiKDTLGTPIAKIEVKDGKG 

2530 2350 2570 

GGGC777CG7CCCCCACGC7GG7GGATAAAGA77XUGACGGCACGG7CGA7A7CGCCTAT 
GZ.8SP7LVDKDLOG7VPZAY 
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2590 2610 2630 

CCC0SC3ACCa«S6OCO0CAATATQTACCQCITT0ATTTOAOCAATTCCSATTCTAOTAAA 
A60R0GNMVRF0 L8N8DSSK 

2650 J670 2 690 

iGaiCTOCAAAGGTTATTTTCOAAaOCOACAAGCCSATTACCTCCSCOCCCOCCOTITCC 
W 8 VIFB GDKPITSAPAVS 

2710 2730 2750 

CGACTGGCAGACAAACOCOZCGTCATCTTCGSTACGQaCAQCGATTTQACCBAAGATOAT 
RfcAPKRVVXFGTGSDLTEDD 

2770 2790 2810 

GTACTGAATACGG8CGAACAATATATTTACGGTATCTTTGACGACGATAAGGGGACG6TT 

V L N T G E Q ¥ tVGIFDODKGIV 

2850 - 28S0 2870 

AAGGTAACGGTACAAAACGGCACGSCAGGCGGGCTGCTCGAGCAACACCTTACTCAGGAA 
K V ONGTAGGtLEQHLTQE 

28*0 2910 29)0 

AATAAAACAtTAOTCCTGAACAAOAGAICCGACGGrrCOGGCAGCAAGQGCTGGeCGGTG 

K tLHKRS B<3SGS KGWAV 

2950 2970 2990 

AAATTOAGBGAAGOAGAACGCGTTACCOTCAAACCGACCGTGGTATTGCGTACCGCCTTC 
L GBRVTV KP TVVIiRTAP 

3010 3030 3050 

GTAACCATCCGCAAATATAACOACGGCGGCTQCGGCGCGGAAACCQCCATTTIGGGCATC 
KYMD «QC GAETAILGI 
3070 3090 3110 

N * gqaltprsarp IVPDH 

3130 3130 3170 

AATTCOGTTGCGCAATAtTCCGGCCATAAOACAACCTCCAAAGGCAAATCCATCCCXATA 
QrS6 HKTia KG K5IPI 

3190 3 210 3230 

GGTTGXATGGACAAAGACGGTAAAACCGTCTGCCCGAACGGAIATGOTIACaACAAGCCG 

KD <SKTVCPNG*VYDKP 
32 50 3270 3290 

OTTAATGTGCGTTATWCGATGAAACGGAAACAGACGGATTTTC^ 

V N V R VLDETETDGFSTTADG 

3310 3330 3380 

GATGCGGGCGGCAGCGGTATAGACCCCGCCGGCAGGCGTCCCGGOAAAAACAACOiCTGC 
SOI °PAORRPGKNKRC 
3370 3390 3410 

TTCTC«AAAAAGGGGTGCGCACCCTGCIGATGAACGATTTGGACAGCTIGGATATTACC 
6VRTI 'I'MMDI,DSIiDIT 
3430 3430 3470 

GGCCCGAT0T0C0GTATCAAACGCTTAAGCTGGC6CGAAGICTTCTTCTGACCGGCCTGC 

0IKR I-fiWRBVFr* 
349 0 3910 3H0 

OCGGCCGGTITTTCCGCAAAIGCCBTCCGAAAGGCCXTCGGACGGCATTTTTTIGCGTTT 
TTCGGGAG0GGGGCGGCAAA7GAAACG 
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1 CCGCTGTATGTGTATGATTTGQAAAACACCAGTGCTACn'CTGATTAAAAAAATCGAAGCA 
CCCGGCGGCAAAGGCGGGCTTTCGTCCCCCACGCTGGTGGATAAAGATTTGGACGGCACG 
QTCGATATCGCCTATGCCGGCGACCGGGGCGGCAATATGTACCGCTTTGATTTGAGCAAT 
TCCGATTCTAGTAAATGGTC7GCAAAGGTTATTTTCGAAGGCGACAAGCCGATTACCTCC 
GCGCCCGCCGTTTCCCGACTGGCAGACAAACGCGTGGTTATCTTCGGCACGGGCAGCGAT 
TTGAGTGAACAGGATGTACTGGATACGGACAAACAATATATTTACGGTATCTTTGACGAC 
GATAAGTCGACGGTTAATGTAAAGGTAACAAACGGCACGGGAGGCGGGCTGCTCGAGCAA 
GTGCTTAAAGAGGAAAGTAAAACCTTATTCCTGAGCAATAATAAGGCATCCGGCG3ATCG 
GCCGATAAAGGGTGGGTAGTGAAATTGAGGGAAGGAGAACGCGTTACCGTCAAACCGACC 
GTGGTATTGCGTACCGCCTTTGTCACCATCCGCAAATATACGGATACGGACAAATGTGGC 
GCGCAAACCGCCATTTTGGGCATCAATACCGCCGACGGCGGCGCATTGACTCCGAGAAUC 
GCGCGCCCGATTGTGCCGGATCACAATTCGGTTGCGCAATATTCCGGCCATCAGAAAATG 
AACGGCAAGXCCATCCCGG 7 3 $ 
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PLYVYDLENTSGSLIKKIEA 
1 CCGCTGTATGTGTATGATTTGGAAAACACCAGTGGTAGTCTGATTAAAAAAATCGAAGCA 

. t • • ♦ 

PGGKGGLSSPTLVDKDtiDGT 

CCCGGCGGCAAAGGCGGGCTTTCGTCCCCCACGCTGGTGGATAAAGATTTGGACGGCACG 

V D IAYAGDRGGHM*R F D I* S N 
GTCGATATCGCCTATGCCGGCGACCGGGGCGGCAATATGTACCGCTTTGATTTOAGCAAT 

§••••• 
SDSSKTfSAKVXF&G. DECPXTS 
TCCGAWCTAGTAAATGGTCTGCAAAGGTTATTTTCGAAGGCGACAAGCCGATTACCTCC 

• • • • • • 
APAVSRfcADKRVVIFGTGSD 
GCGCCCGCCGTTTCCCGACTGGCAGACAAACGCGTGGTTATCTTCGGCACGGGCAGCGAT 

• ••••• 
LSEQOVLDTDKQYIYaiFDD 

TTGAGTGAACAGGATGTACTGGATACGGACAAACAATA^AXTTACGCTATCTTTGACGAC 

• • • « • • 

DKSTVNVKVTMGTGGGLtiEQ 

GATAAGTCGACGGTTAATGTAAAGGTAACAAACGGCACGGGAGGCGGGCTGCTCGAGCAA 

• • • • » • 
VLKEBSKTLFLSNNKASGGS 
GTGCTTAAAGAGGAAAGTAAAACCTTATTCCTGAGCAATAATAAGGCATCCGGCGGATCG 

• ••••• 
ADKGWVVKLREGERVTVKPT 
GCCGATAAAGGG7GGGTAG7GAAATTGAGGGAAGGAGAACGCGTTACCGTCAAACCGACC 

• • • • ■ • 
VVLRTAFVTIRKVTDTDKCG 
GTOGTATTGCGTACCGCCTTTGTCACCATCCGCAAATATACGGATACGGACAAATGTGGC 



AQTAILGlNTADGGAlaTPRS 
GCGCAAACCGCCATTTTGGGCATCAATACCGCCGACGGCGGCGCATTGACTCCGAGAAGC 



A R P IVPDHNSVAQYSGHQKM 
GCGCGCCCGATTGTGCCGGATCACAATTCGGXTGCGCAATATTCCGGCCATCAGAAAATG. 

M G K 6 I P 
AACGGCAAGTCCATCCCGG 739 
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POGKGGt' SSP TX.VDKD L P G T 
CCCOOCGOCAAAGGCGaQCTTTCGTCCCeXACGCTGGT^T^^ 

^11 1 1 I 1 1 I M II I 1 1 1 1 1 I 1 1 1 I II 1 1 1 M I 1 1 1 1 1 1 M I 1 1 I I II 1 1 M M I 1 1 I 

M^ACGGC^AGGCGOaCTTTCGTCrcCC^^ 

KftGKGGl.SSPTLVDlCOI.OGT 

VDIATAGDRGGMMrRrDIiSN 
GTCGAtATCCCCTATGCCGGCGACCGGGGCGGCiATATGTACCGCTTTG&OTW^AAT 

I 1 1 1 I I I I I 1 1 I I I I I 1 1 I I I I I M 1 1 II 1 1 I I 1 1 I I I I I I I I I I 1 1 1 1 I < I I I 1111 
GTCTATA7CGCCTATGCCGGCGACCGGGGCGGCJUITATGTACCGCTTTGATTTGAGCAAT 

VDXAXAGDRGGNMVRFDLSN 

SDSfiKWSAKViyBSDK P I * S 
TCCGAITCTAGTAAATGGWT^AAAGGTTATTTTCGAAGGCGACAAGCCGATTACCTCC 
I 1 1 1 1 1 I I I I I I I I I I I I I I I I I 1 1 I I I I I III I 1 1 I 1 I I I I I 1 1 I I 1 1 I I 1 I I I I I I I I 
TCCGATTCTAGTAAATGGTCTGCAAAGGTTATTTTCGAA66CGACAAGCCGATTACCTCC 
3DSSKWSAKVirEGDKPIT8 

• • * * 

APAVSRIi AOK RVVIPOTOSD 

GCGCCCGCCGTTTCXC6ACTGGCAGACAAACGCGTGGTTATCTTCGGCACGGGCAGC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii in nil ' 1 JLjLJL JULIJUULii 

GC6CCCG CC GII SC CC SACTCGCAOACAAACOCGTCGTC&TCTTCGGTACGGGCAGCGAT 
APAV3RfcADKRVVXfGTG3D 

L8EQDVIiDTOKQri^OtrO 0 
TTGAOTGAACAGGATGTACTGGATACGGACAAACAATATATTTACGGTATCOTT^ACGAC 

mi in i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i tiiiiiiiiiiiiniiiiiniiiiiii 

TTGACCOAAGATGATGTACTGAATACGGGCBAACAATATATTTACGOTATCTTTGACGAC 
LZSaOVLHTfiEQ^ ITOtrDD 

^ k • 

okstvmvkvtngtoggl IvJLj* 

GATAAGTCGACGQTSAATGTAAAGGTAACAAACGGCACGGGAGGCGG6CTGCTOTAGCAA 

llllll tllllllll Nil III! Illllllllll 1 1 1 1 1 1 1 1 I ! 1 1 1 M 1 1 1 1 
GATAAG6G8ACGGTIAAGGTAACGGTACAAAACGSCACGGCAGGCGGGCTGCTCGAGCAA 

DKfiTVJSV2VfiNGTAG01' I ' BQ 

V L K E E 3 K* Tbr*&SMMXA LJ^^L,. 
GTOCTTI^flAGGAIJkOTAAAAlWTTATTCCTGAGCAATAATAAGGCATCCGGCG^TCO 

"u 111,11 mm nun" 1 11 iiiilJiLiii 

CACCTTACTCAGGAAAKTAAAACATTATTCCTG AACAAGAGATCCGACGGTTCO 

flLiaBUKTLrL NKfiSaGS 

ADKGWVVKX.R8GBRVTVK P T 
GCCGATAAAGGGTGGGTAGTGAJUTTGAGGGAAGGAGAACGCGTTACOTTOlAACCGAtC 

ii ii ii mi 1 1 1 1 1 1 n 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 m i 1 *JL illJJLJLIJLJL 

GGCAGCAAGGGCTGGGCGGTGAAATTGAGGGAAGGAGAACGCGTTACCGTCAAACCOAV,^ 
GSKGWAVKI.RBGSRVTVKPT 
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VVLRTAFVT XRKYTDTDKCG 
GTGGTATTGCGTACCGCCTTTGTCACCATCCGCAAATATACGGATACGGACAAATGTGK 

f 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 i M 1 1 1 1 1 1 it it ii in 

GTGGTATTGCGTACCGCCTTCGTAACCATCCGC AAATATAACG A * . . CGGCGGCTGCGGC 
VVLHTAr;VTZRKYKD (ifiCG 

AQTAILGINT ADGGALTPRS 
GCGCAAACCGCCATTTTOGGCAICAATACCGCCGACGGCGGCGCATTGACTCCGAGAAGC 
Ml >llli III M lUillliMtilllllll [| I lllllll MM llllllll Mill 
GCGGAAACCGCCATTTTGGGCATCAATACCGCCGACGGCGGCGCATTGACTCCGAGAAGC 
AfiTAILGlNTADGGALTPRS 

A R P XVPDHNSVAQYSGH* QKM 
GCGCGCCeGATTGTGCCGGATCACAAXTCGGTTGCGCAATATTCCGGCCATCAGAAAA . 
IMIIIMMI I || miMIIMINIilMI III II IIMIIIII MM III II 
GCGCGCCCGATTGTGCCGGATCACAAWCGGTTGCGCAATATTCCGGCCATAAGACAAC 
A R P IVPDHNSVAQrSGKltll 

N G K S I P 
. . TGAACGGCAAGTCCATCCCGG 739 

MM Mill llllllll 
CTCCAAAGGCAAATCCATCCCAT 3176 
& X G K S I P 
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